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Abstract 



Earth was formed about 4.54 billion years ago. In less than a billion years the first 
life forms appeared on a still very 'young' earth. Even the simplest cells today 
are extremely complicated and require several components to work coherently. It 
is hard to imagine how such complexity, as seen in life, would have arisen at the 
origin of life. Several approaches, experimental as well as theoretical, are being 
used to explain the origin(s) of life, but a comprehensive explanation still eludes 
us. 

Miller's experiment in 1953 was one of the first successful attempts to show 
that some of the precursors of life, amino acids, can be produced in simulated 
pre-biotic environments. Recent experiments have also presented a possible way 
of producing ribonucleotides in prebiotically plausible conditions. It has been 
thus far not possible to show how these precursors can combine or self-organize 
to form large molecules that are needed for life to function. Large molecules 
such as proteins and nucleic acids are crucial for life. The production of these 
large molecules, as we know it today, requires good catalysts, and the only good 
catalysts we know that can accomplish this task consist of large molecules. There 
exists no other natural process we know of that produces large molecules and that 
does not itself use large molecules. Thus the origin of large molecules is a 'chicken 
and egg problem' in chemistry. We expect that the answer to the question lies in 
the processes that occurred before life originated. 

In this thesis we present a mechanism, based on autocatalytic sets (ACSs), that 
is a possible solution to this problem. We present a mathematical model describing 
the population dynamics of molecules in an artificial but prebiotically plausible 
chemistry in a well stirred chemical reactor. In the model large molecules can in 
principle be produced by successive ligations of pairs of smaller molecules. The 
smallest molecules in the chemistry compose the 'food set', whose concentrations 
are considered to be buffered. The chemistry contains a large number of sponta- 
neous reactions of which a small subset could be catalyzed by molecules produced 
in the chemistry with varying catalytic strengths. Normally the concentrations of 



large molecules in such a scenario are very small, diminishing exponentially with 
their size. ACSs, if present in the catalytic network, can focus the resources of 
the system into a sparse set of molecules. ACSs can produce a bistability in the 
population dynamics and, in particular, steady states wherein the ACS molecules 
dominate the population, i.e., have higher concentrations compared to the rest of 
molecules in the chemistry (background). 

In this thesis we attempt to address two main questions: First, under what 
circumstances do molecules belonging to the ACSs dominate over the background, 
and second, starting from an initial condition that does not contain good cata- 
lysts, can a sparse set of large molecules (containing several tens or a few hundred 
monomers) that are good catalysts arise and be maintained in the reactor at con- 
centrations significantly above the background? We find that the mere existence 
of ACSs in the chemistry at the level of network topology is not sufficient to guar- 
antee their domination at the population level (as also reported by Bagley and 
Farmer in their model). We further quantitatively characterize the interplay be- 
tween ACS topology and rate constants including catalytic strengths, dissipation 
rate, etc., that results in ACS domination. We show that for ACS molecules to 
dominate above the background, the catalytic strength needs to be sufficiently 
large, growing exponentially with the size of the catalyst molecule. 

We present a possible resolution to this problem via a hierarchy of nested 
ACSs. We show that if an ACS catalyzed by large molecules contains within it (or 
partially overlaps with) a smaller ACS catalyzed by smaller molecules (we refer to 
this as a 'nested ACS' structure), the catalytic strength required for the large ACS 
to dominate comes down significantly. Effectively the small ACS reinforces the 
larger one. We construct a cascade of nested ACSs with each successive and larger 
ACS in the cascade containing (a part of) the smaller ACSs, thereby receiving 
reinforcement from them and in turn providing reinforcement to the next larger 
ACS in the cascade. We show that when the network contains a cascade of nested 
ACSs with the catalytic strengths of molecules increasing gradually with their 
size (e.g., as a power law), a sparse subset of molecules including some very large 
molecules can come to dominate the system. We exhibit an example in which a 
large catalyst (with more than 400 monomers) arises with a significant population 
in the steady state starting from only monomers in the initial state. 

Thus our work presents a possible resolution to the chicken and egg problem 
in chemistry posited by the existence of large molecules. It suggests an incre- 
mental bootstrap mechanism - the existence of a hierarchy of nested ACSs in the 



chemistry - through which a sparse subset of large molecules can naturally arise 
starting from plausible prebiotic initial conditions. 
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Chapter 1 



Introduction 



"/t is often said that all the conditions for the first production of a 
living organism are now present, which could have ever been present. 
But if (and oh! what a big if!) we could conceive in some warm pond, 
with all sorts of ammonia and phosphoric salts, light, heat, electricity, 
etc., present, that a protein compound was chemically formed ready 
to undergo still more complex changes, at the present day such matter 
would be instantly devoured or absorbed, which would not have been the 
case before living creatures were formed." 

- Charles Darwin (1809-1882) 

EARTH is special in that it is the only known planet that supports life. Yet, life 
is ubiquitous on Earth. Within a rather short period after its creation, life 
emerged on Earth. Over the course of 3.5 - 4 Gyr (gigayears) life has evolved to 
its current form. While biologists have made great progress in understanding how 
life 'works', there has not been an equal progress on the question of life's origin. 
This thesis is an attempt to formulate and answer one of the many questions 
that relate to the problem of origin of life on Earth. More concretely we discuss, 
with the help of a mathematical model of primordial chemistries, the set of initial 
bootstrapping processes that could have led to the production of large molecules 
that were needed for the emergence of life. In this chapter, we discuss some of the 
facts known about life and the some of the questions that relate to its origin. 

1.1 The Earth's history 

The age of the Earth is approximated to be 4.54 billion years (±1%) fl]-[3]. Fig. |1.1| 
illustrates the major events in the history of Earth since its birth. Formed from a 
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gas cloud made up of hydrogen and helium and interstellar dust under intense heat 
and pressure, at its birth Earth was a molten planet. Over the next few hundred 
million years, it cooled to form a solid crust ^47]. As it cooled further, clouds 
formed and rains created the ocean. It is also believed that part of the water that 
exists on Earth was delivered by impacting comets that contained ice. There are 
evidences to suggest that oceans formed on Earth between 4.4 - 4.2 Ga (gigayears 
ago) [iljSlJQ]. (It was earlier believed that it took much longer for Earth to cool 
enough to be able to support liquid water.) The Earth's atmosphere contained 



water vapour, carbon dioxide, nitrogen and smaller amounts of other gases 10 



It is estimated that in about 600 million years (around 3.8 Ga) first life forms 



appeared 1 1 . The earliest living organisms were microscopic bacteria, which show 



up in the fossil record as early as 3.5 Ga |12-16 . Some other studies suggest that 
life could have been present even earlier (as early as 4.25 Ga) 17 19 . The earliest 
life forms were chemoautotrophic, i.e., they relied on chemical sources for energy. 
Later, photosynthesis was adopted as a mechanism for energy production [20|. 
Photosynthetic prokaryotic organisms emitted oxygen as a waste product [21j. 
This is believed to have caused the 'great oxygenation event' at 2.45 Ga leading to 



release of oxygen into the atmosphere 22-26] (see Fig. 1.2 ). There are studies that 



suggest alternate mechanisms for oxygenation of Earth's atmosphere 27,28 . The 
presence of oxygen provided life with new opportunities. Organisms switched to 
aerobic metabolism, which is more efficient than anaerobic pathways. Multicellular 
life appeared soon after oxygen became available. As the oxygen level rose further 
life became more diversified and led to the 'Cambrian explosion' around 530 Ma 
(million years ago). Humans emerged on Earth only 2.2 Ma. 

1.2 Life 

Life is one of the most complex phenomena known to human kind. Life involves 
several intricate processes, such as metabolism, genetic regulation, formation of 
physical structures, etc., that work in unison to allow organisms to harness energy, 
repair themselves, grow and replicate. Each of these processes is indispensable for 
life, and depends upon other processes to be able to successfully perform its task. 

1.2.1 Life: An organized system 

Cells are the fundamental units of life. Cells respond to the environment, conduct 
internal processes, grow and replicate with variation. A cell contain several small 
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ca. 380 Ma: 
First vertebrate land i 
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Figure 1.1: Earth's geological time 'clock'. Earth began to form between 4.6 - 4.5 Ga 
(gigayears ago) from a cloud of gas (mostly hydrogen and helium) and interstellar dust. It was 
formed under intense heat and pressure and was initially a molten planet. During the Hadean 
Period (up to nearly the first billion years of formation) , the Earth was bombarded continuously 
by asteroids, meteors and comets. By 3.8 to 4.2 Ga, the Earth cooled and became a planet with 
an atmosphere and an ocean. Earth's atmosphere was formed mostly from water vapour, carbon 
dioxide, nitrogen, and small amounts of carbon monoxide, methane, ammonia, hydrochloric acid 
and sulfur produced by the constant volcanic eruptions. It had no free oxygen. It is believed 
that around this time first life forms appeared on Earth. Earth's initial life forms were bacteria, 
which could survive in the highly toxic atmosphere that existed during this time. Around 3.5 
Ga, oxygen- forming photosynthesis began to occur. Once oxygen began to build up around 2.5 
Ga, it made way for the emergence of more complex life as we know it today. About 530 Ma, 
'Cambrian Explosion' happened where life exploded developing almost all of the major groups 
of plants and animals in a relatively short time. The first modern human species emerged only 
about 2.2 Ma. Image source: http://en.wikipedia.org/wiki/History_of_Earth, 
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Figure 1.2: Oxygenation of Earth's atmosphere. Red and green lines plot the range of 
the estimates of atmospheric oxygen, time is measured in billions of years ago (Ga). Initially 
atmosphere was devoid of any free oxygen (Stage 1: 3.85 - 2.45 Ga). Though photosynthetic 
bacteria were producing oxygen, it was getting captured by organic matter and dissolved iron 
(and other reduced material) chemically. As these started to saturate, around 2.45 Ga, the 'great 
oxygenation event' occurred. Oxygen was now released as free molecule and was absorbed in 
oceans and seabed rock (Stage 2: 2.45 - 1.85 Ga). Later, oxygen started to gas out, but got 
absorbed by land surfaces and formation of the ozone layer (Stage 3: 1.85 - 0.85 Ga). Around 
this period the first multi-cellular organisms started to appear. As the oxygen sinks got filled, 
the gas started to accumulate in the atmosphere (Stages 4 and 5: 0.85 Ga - present). It is believe 
that high levels of oxygen led to the 'Cambrian explosion' (which happened at the beginning of 



Stage 5). Image source: http://en.wikipedia.org/wiki/Geological_history_of_oxygen 



1.2. Life 

and large molecules enclosed in a membrane. The membrane provides the cell a 
spatial form and separates the interior from the environment. The membrane con- 
tains several proteins which sense external signals, and allow a selective exchange 
of material between cell and the environment. The internal machinery of the cell 
is complex, involving metabolism, regulation and signaling. 

A cell converts simple food molecules absorbed from the environment (such as 
glucose, fructose, etc.) into other 'building-block' molecules (such as amino acids, 
nucleotides, hpids, etc.) and produces energy, which is essential for the cell to grow 
and function. This process is called 'metabolism'. Metabolism only involves the 
chemical conversion of small molecules called metabolites, but is dependent upon 
the availability of a set of 'large' molecules called enzymes (a class of proteins 
that catalyze metabolic reactions). Enzymes govern which metabolic processes 
happen inside a cell. The instructions to produce these (and other) proteins are 
contained in the DNA of the cell. When needed this information is read and the 
corresponding proteins produced via a two step process called gene expression. 
In the first step, called 'transcription', a machine called RNA polymerase reads a 
gene on a DNA molecule and assembles nucleotides in right linear order to pro- 
duce the corresponding single-stranded messenger RNA (mRNA) molecule (Fig. 



1.3a). Then, in the processe called 'translation', another machine called ribosome 



reads the mRNA molecule and assembles amino acids in the right linear order to 



produce the corresponding protein molecule (Fig. 1.3b). To be able to produce 
these macromolecules, RNA or the proteins, the cell needs the small building- 
block molecules that are the end product of metabolism - the nucleotides and the 
amino acids (which in the first place require these macromolecules for their own 
production) |29j. It also needs the machines, RNA polymerase and ribosomes that 
are made up of a number of macromolecules. 

The genetic code stored in DNA is "interpreted" during gene expression, and 
the properties of the expression (proteins, RNAs produced as result of gene ex- 
pression) give rise to the organism's phenotype. This process also highlights the 
information flow inside the cell. In 1958, Francis Crick pointed out that the infor- 
mation flow can happen only from nucleic acids to proteins and such information 



cannot be transferred back from protein to either protein or nucleic acid 30,31 
In other words a protein sequence can not be used to create a DNA in the cell. 
In the usual case the information flows from DNA — )■ RNA — )■ proteins, and never 
from protein — )■ protein or protein — )■ DNA or RNA. 

With the kind of elaborate machinery described above, life has to maintain a 
balance between various participating subcomponents. This requires regulation 
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Figure 1.3: Gene expression and production of proteins, (a) The first step in gene 
expression is transcription. RNA polymerase (RNAP), an enzyme, binds to the DNA molecule 
and produces RNA using DNA genes as template. RNA polymerase first unzips the DNA 
double-strand by breaking the hydrogen bonds between complementary nucleotides and then 
adds complementary RNA nucleotides paired with DNA bases. RNA nucleotides join together 
by forming the sugar-phosphate backbone with the help from RNA polymerase. As the hydrogen 
bonds of the untwisted RNA-I-DNA helix break the newly synthesized RNA strand start to 
separate from DNA. (b) Translation of mRNA and the synthesis of proteins by a ribosome. 
Ribosomes are made up of RNAs and proteins called ribonucleoproteins and are divided into 
two subunits. The smaller subunit binds to the mRNA, while the larger subunit binds to the 
tRNA and the amino acids. Each triplet of nucleotides on mRNA, called codon, corresponds to 
a binding site complementary to an anticodon triplet in tRNA. tRNAs with the same anticodon 
sequence always carry identical type of amino acid. The ribosome helps complementary tRNA 
to bind to mRNA and takes the amino acid from each tRNA, chaining them together to form 
protein. Images source: jhttp://en.wikipedia.org/wiki/Gene_expression| 
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and synchronization of all the processes for life to be able to perform its tasks 
efficiently. This adds another level of complexity to the way life functions. Based 
on the environmental conditions (and some internal cues, such as cell division, 
etc.) the cells may express different sets of genes at different times. The ability to 
differentially express genes based on different environmental conditions gives cell 
the ability to adapt to different environments. The process of regulation also allows 
the same genome to express different phenotypes, as is the case for multicellular 
organisms where the same stem cell can differentiate into different adult cell types. 
Regulation involves both small and large molecules and it happens at all levels of 
gene expression. 

A similarly large and complex biochemical machinery is needed to produce a 
copy of DNA at the time of cell division. The process, which is called DNA repli- 
cation, involves several macromolecules, such as, DNA polymerase, DNA ligase, 
helicase, etc. In this process, DNA is used as a template on which these complex 
assemblies of proteins work to produce a copy of the DNA. Errors in copying can 
produce a non identical copy of DNA which results in two genetically different 
offsprings, producing the variation necessary for the evolution of life. 

1.2.2 Diversity and universality of life 

Life appears in various forms and sizes on Earth. The smallest cell measures a few 
hundred nanometers and the largest organism measures about tens of meters. Life 
can exist in the harshest of habitats ranging from extremely hot to extremely cold 
places. The diversity of life can be estimated from the fact that about 10^ — 10^ 



different eukaryotic species inhabit Earth 32 , 33 . A recent study put this number 
at 8.7 ± L3 million |^34j. Of this only about 10% are known to us. In addition, 
it is estimated that there are about 4 — 6 x 10'^° prokaryotic cells that live on 



Earth 35 . One may imagine that such a diversity of organisms would require 
different mechanisms to operate. Though the specific pathways differ amongst 
different organisms, the biochemical machinery involved in encoding, transcription 
and translation is universal. It is believed that all life forms diverged from a 
common ancestor, called the Last Universal Common Ancestor (LUCA). Based 
on similarity of the genes, proteins, and metabolism, the tree of life has been 



constructed rooted on a common ancestor 36-39 
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1.2.3 Minimum complexity of life 

Mycoplasma genitalium, a parasitic bacterium which hves in the primate genital 
and respiratory tracts, is the smallest known organism capable of independent 
growth and reproduction. The genome of M. genitalium consists of 521 genes of 



which 482 are protein encoding genes 40 . In comparison most bacterial genomes 



typically encode for more than 2000 different proteins. E. coli, the best studied 
bacterium, has about 4500 genes. The proteins produced from these genes are 
involved in metabolism, replication, as well as building and maintenance of struc- 
ture. A theoretical study based on comparison of the genome of M. genitalium 
with that of Haemophilus influenzae suggested 256 genes that were close to the 
minimal set necessary and sufficient to sustain the existence of a modern-type 



cell 41 . It has been estimated that a minimal bacterium will require about 206 



protein coding genes based on an analysis of several free-living and endosymbiotic 



bacterial genomes 42 . It is worth noting that according to this estimate even the 



simplest cells require over 200 proteins to function. 

1.3 Origin of Life: The chicken or the egg? 

"// there has been a first man he must have been born without father or 
mother which is repugnant to nature. For there could not have been a 
first egg to give a beginning to birds, or there should have been a first 
bird which gave a beginning to eggs; for a bird comes from an egg." 

- Aristotle (384 - 322 BC) 

The complexity of life has perplexed and enthralled us, more so the enigma of its 



origin 43 - 52 ; 53 - 67 . It is difficult to imagine how an organization, as intricate as 
seen in life, could self-organize on a primordial Earth when none existed. We now 
review some experiments that have provided a perspective on what was possible 
on the prebiotic Earth. 

1.3.1 Miller's experiment and other related works: The spon- 
taneous production of certain building blocks of life 

Miller's experiment was the first experiment to show that amino acids can be 



produced spontaneously in simulated primordial Earth conditions [68] (see Fig. 1.4 



for the description of the experiment). Miller simulated an early Earth atmosphere 
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that contained methane, ammonia, hydrogen gas, and water vapor. It has been 
successfully repeated by others who have also considered other possible primordial 



environmental conditions 69-73 . There have been other experiments that have 



demonstrated production of purines and pyrimidine ribonucleotides 74 - 77 . 

These experiments are based on the studies that suggest a reducing or partially 
reducing atmosphere existed on the early Earth 10)[78 80 . Recent findings suggest 



that the primordial Earth's atmosphere may not be reducing as suggested earlier 



81 . A reducing environment may have existed locally, such as in the presence of 



iron or other metals in early oceans which could reduce nitrites to ammonia 82 83 



or around deep sea hydrothermal vents or volcanic discharges 84-89 allowing a 
prebiotic chemistry to initiate. 

Collectively these experiments indicate that several of the basic building-block 
molecules, such as amino acids, other molecules and possibly also nucleotides could 
be synthesized on the prebiotic Earth by natural processes. 

1.3.2 What came first? 

The above experiments as well as other work on prebiotic chemistry has led to 
several hypotheses about what were the first chemical steps towards life on Earth. 
Broadly these hypotheses have been classified into three categories: 'metabolism 
first', 'replication first', or 'membranes first'. These three scenarios are not neces- 
sarily mutually exclusive. We briefly discuss the three scenarios below. 

Metabolism first: In this hypothesis it is argued that a system of chemical reac- 
tions producing small organic molecules, metabolites, would have appeared 



first 46,60 66,85 90 97 . One proposed candidate for the most primitive set 
of chemical reactions is the reductive tricarboxylic acid cycle (rTCA cycle). 
The TCA cycle is a universal structure found in the metabolic networks 
of most organisms and is the starting point in these networks for chemi- 
cal pathways to synthesize all the building block molecules - amino acids, 
nucleotides and lipids. The pathways leading to the synthesis of simple or- 
ganic molecules might be promoted by natural catalysts such as minerals and 
porous surfaces in rocks, or as an autocatalytic system. The idea is that the 
origin of life was triggered by the accumulation of simple organic molecules 
in thermodynamically favorable conditions. The simple molecules formed as 
result of the above process can combine in various ways to form more com- 
plex molecules, the system as a whole constituting a primitive 'metabolism'. 
The production of all the molecules involved in the primordial metabolism 
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Figure 1.4: Schematic of Miller's experiment. To simulate early Earth conditions, Miller 
used water (HjO), methane (CH4), ammonia (NH3), and hydrogen (Hg) sealed inside a glass flask 
containing a pair of electrodes and connected in a loop to another flask containing liquid water. 
The liquid water was heated to induce evaporation and to add water vapour to the chemical 
mixture. The resulting gases were circulated around the apparatus, simulating the Earth's 
atmosphere. Sparks were fired between the electrodes to simulate lightning. The flask with 
heated water represents water on the Earth's surface and the recycled water vapour simulates 
water evaporating from lakes and seas, before going into the atmosphere and condensing into 
rain. The atmosphere was cooled again so that the water could condense and trickle back into 
the first fiask in a continuous cycle. After one week of continuous operation. Miller observed a 
dark brown scum had collected in the lower fiask. It was found to contain several types of amino 
acids, including glycine and alanine, together with sugars, tars, and various other unidentified 
organic chemicals. Image source: ihttp: //en. wikipedia.org/wiki/Miller-Urey_experimenti 
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is envisaged as a self sustaining system, as it uses as input simple inorganic 
molecules readily available on the Earth and incrementally builds more com- 
plex molecules using catalysis. Eventually, simple peptides will be formed 
and would lead to better catalysts; as well as nucleic acids that would form 
the templates for information storage. 

Replication first: One of the fundamental properties of cells is to replicate. The 
replication of a whole cell is preceded by the replication of a single molecule, 
the DNA molecule, which carries information about all the genes. This in- 
formation is central to how cell works and is transmitted from generation 
to generation through the above molecular replication. In the replication 
first scenario, it is believed that first the molecules capable of replication 
would have arisen. RNA is mooted as the candidate molecule for its ability 
to store, transmit, and duplicate information (like DNA) and act as cata- 
lyst for various chemical processes (ribozyme). It is believed that RNA was 
once capable of supporting independent life forms before giving rise to the 
DNA-RNA-protein world of today. This is called the 'RNA world' hypoth- 
esis. DNA is thought to have taken over the role of data storage due to its 
increased stability at a later stage, while proteins, through a greater variety 



of amino acids, replaced RNA's role in specialized catalysis 98 108 . The 
RNA world hypothesis suggests that RNA in modern cells is an evolutionary 
remnant of the RNA world that preceded it. 

Membrane first: In the membrane first scenario, it is hypothesized that phys- 
ical enclosures were formed first. In water lipid molecules are known to 
spontaneously form micelles or vesicles. These vesicles allow selective in- 
teraction with the environment and can provide an isolated space for other 
systems to grow. The idea of lipid vesicles was first proposed by Oparin 



43 . He proposed that coacervates, which are tiny spherical droplets of 
assorted organic molecules (specifically, lipid molecules) held together by 
hydrophobic forces from a surrounding liquid, would form first and followed 
by cell-like structure. A variant of this scenario is popularly called the 'lipid 



world' 109-113 . These vesicles can grow by the addition of monomers 



and divide to produce two vesicles 114,115 . Additionally these vesicles 



can provide semi-permeable membranes for small molecules to pass through 



and enable chemical reactions to fiourish inside the vesicle 116 118 . As 
an alternate, proteinoids (protein-like molecules formed by inorganic poly- 
merization of amino acids) have been proposed as the precursors to first 
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living cells. When present in certain concentrations in aqueous solutions, 
proteinoids form small structures called microspheres or protocells capable 



of growth and division 119 120 



None of these theories is self-sufficient and there are several issues that need to 



be resolved to have a comprehensive scenario for the origin of life 60 , 62 , 121 , 122 



A common theme that these hypotheses are based upon is that a self sustaining 
system arises first, providing the right starting point for more complex organiza- 
tion to arise. It could be a metabolism, which gave birth to protein world and 
subsequently RNA/DNA world; or beginning with an RNA world, that later gave 
rise to protein and DNA life; or beginning with a lipid world and later giving rise 
to metabolism/replicating molecules. There are models that discuss co-evolution 



of more than one subsystem in prebiotic scenarios 116, 123 128 . 



1.4 Motivation 

Instead of asking specific historical questions about whether the TCA cycle, or 
RNA molecules, or lipid enclosures came first, one can posit some general questions 
at a higher level of abstraction. These questions are also motivated by empirical 
facts about living systems. Our work is motivated by three such observations 
about the structure and organization of the biochemical machinery of life. 

1. The existence of a chemical organization. As described earlier a cell 
contains molecules with specific properties carrying out specific carefully 
orchestrated processes. Without these, it will not have the capability to 
uptake food sources from the environment, convert them to building blocks 
of life through chemical processes, grow, replicated and evolve. Life is an 
'organized' chemical system. Each of the process is well regulated and syn- 
chronized with other processes. The existence of such an organization is in 
itself interesting, but it also poses a question: how did such an organization 
arise in the first place? 

2. The question of a sparse biochemistry. The set of molecules that 
participate in biochemical pathways is a very small subset of all organic 

The Kyoto Encyclopedia of Genes and Genomes 



molecules 93,129 130 



(KEGG) database lists 16,311 metabolites and small molecules in all char- 
acterized organisms (1,761 organisms) participating in 8,840 biochemical re- 



actions 131, 132 . In a recent analysis of metabolism of Aquifex aeolicus, a 
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reductive chemoautotroph and the smallest freely living bacteria, only 125 



compounds were found in the core metabolism 129 . In comparison the 



number of small molecules weighing less than 300 Da has been estimated to 



be about 13.9 millon 133 and all molecules of up to 11 atoms of C, N, O, 



and F possible under consideration of simple valency, chemical stability, and 



synthetic feasibility rules were found to be 26.4 million in number 134 . The 
fact that biochemistry uses a very sparse subset of molecules point to some 
underlying pruning principles. The primitive biochemistry need not have 
been as sparse as the modern biochemistry (selection has operated for long), 
but would have seeded the selection that exists now. We wish to understand 
at the level of dynamics, the selection rules that could have been operative 
in selecting such a sparse set of molecules. Could the chosen subset have 
some special properties that enable it to spontaneously arise in prebiotic 
conditions? 

Production of large molecules need large molecules: A chicken- 
and-egg problem in chemistry. Large molecules, such as RNA and pro- 
tein molecules, containing about a hundred to several thousand monomers. 



are essential in the functioning of life. As discussed in the Section 1.2.1 
these molecules are responsible for the normal functioning of the cell. These 
molecules are produced inside a cell using machinery which is itself composed 
of such large molecules. The RNA polymerase reads the genes on DNA 
molecules and produces the corresponding messenger RNA molecules. The 
ribosome reads the messenger RNA molecule and produces the correspond- 
ing protein molecules. These machines, RNA polymerase and ribosome, are 
composed of macromolecules such as proteins and RNA molecules that are 
produced by the same process. When cells produce daughter cells, the latter 
are already endowed with these macromolecular assemblies at birth, from 
which they synthesize other molecules. Nowhere in the living world is there 
a natural process we know of that produces macromolecules and that does 
not itself use macromolecules. Hence the puzzle: how did the large molecules 
first arise? There has been much experimental work to explore mechanisms 
that could enhance the concentrations of monomers and synthesize long poly- 
mers 55 , 105 , 135 139 . While there is interesting progress, as yet there is no 



compelling scenario for the primordial origin of large molecules 140 141 



One can ask if there are generic processes that might plausibly happen on the 
prebiotic Earth that could provide an answer to the above questions. Can one 
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theoretically model such processes? Before we turn to models, we discuss one 
more empirically observed chemical property that seems to be fairly generic. 



1.4.1 Possible role of small molecule catalysis 

It has been observed that catalysis is a fairly ubiquitous property that arises in 
different kinds of molecules and even at small sizes. A catalyst is a molecule that 
speeds up the rate of a reaction without being consumed in the reaction. Enzymes 
are very powerful catalysts of metabolic reactions in a cell; they speed up the rate 
of the reactions they catalyze by factors ranging from 10^ to 10^^ 142 . They 



are also very specific catalysts, in that each enzyme catalyzes typically a single 
reaction only. Enzymes are very large protein molecules (containing from a few 
hundred to a few thousand amino acids) and were not available in the early stages 
of prebiotic chemical evolution. 

However, small organic molecules which could have been produced by natural 
prebiotic processes are also known to have catalytic activity which is weaker and 



less specific. These include organocatalysts 143 145 , peptides (small polymers 
of amino acids) (59, 146 149 , and RNA molecules 150 154| . Cofactors, which 
play an important role in catalyzing metabolic reactions (or their evolutionary 



predecessors) may also have had a role in prebiotic catalysis 1129 



1.4.2 Autocatalytic sets 

The molecules produced in the spontaneous reactions from the prebiotic precursors 
(produced in Miller type experiements) may have included weak catalysts for 
other reactions involving the same set of chemicals as reactants, thereby producing 
in turn a new set of molecules. In this respect, autocatalysis has been mooted 
as an important self-organizing property. In chemistry, a reaction is said to be 
autocatalytic if one of its products catalyzes the reaction, e.g., A + B — > C. An 
autocatalytic reaction has the consequence that the production of a small quantity 
of the product, C, results in a positive feedback that produces more of C. 

A set of chemical reactions can be said to be "collectively autocatalytic" if 
the products of the set mutually catalyze all the reactionq^ Autocatalytic sets 



(ACSs) were proposed by Eigen 155 , Kauffman 156 and Rossler 157 . An ACS, 
like a single autocatalytic reaction, has the property that the production of a 
small amount of the products of the ACS can, in principle, lead to the collective 



^A formal definition of autocatalytic sets relevant for our work is given later in Chapter |4J 



Section 4.1 
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production of an even greater amount of the same. Thus if an ACS is present 
in the chemistry this could lead to focusing of chemical resources preferentially 
into the products of the ACS. This could address the problem of why a sparse 
set of molecules has come to dominate biochemistry. Furthermore, an ACS is a 
cooperative organization, in that the products collectively reinforce each other's 
presence. It is conceivable that ACSs can therefore help us understand the coop- 
erative nature of biological organization. 

These properties have motivated many people to explore ACSs as a possible 
organization principle in the origin of life problem in variety of ways. ACSs have 
been used by many authors to study various aspects of self-organization, evolution 



and the origin of metabolism 45 , 126 ,158 166 , the origin of replication 167 169 



and the origin and dynamics of protocells 110, 116, 117, 127, 170 



1.4.3 Models of prebiotic autocatalytic organization 

We now discuss some models of autocatalytic organization that have motivated 



the present work. A model proposed by Farmer, Kauffman and Packard 158 



discusses a mechanism based on ACSs via which a selected set of molecules can 



dominate the chemistry (see Section 1.4.3.1 ). Another model of network evolution 



by Jain and Krishna demonstrates how a complex autocatalytic organization can 
arise and grow when none exists to begin with 163 164 171| (see Section 1.4.3.2). 
We discuss these two models below and also state their limitations which we 
address in this work. 



1.4.3.1 A model for autocatalytic chemical focussing 



The model was proposed by Farmer, Kauffman and Packard 158 and later dis 



cussed and analyzed in series of papers by Bagley et al 172 , Bagley 173 , and 



Bagley and Farmer 159 1. In the model they consider an artificial chemistry in 



which polymers could form by ligation of shorter polymers through spontaneous 
reactions as well as reactions catalyzed by other polymers in the chemistry. They 
considered a set of monomers {a,b,c, . . .} as the starting material, members of 
which could combine together to form polymers of different lengths. The polymers 
are represented as strings of the monomers (a, 6, c, . . .), such as, accaaabd, where 
the order in which the monomers combine differentiate a molecule. Furthermore 
they are considered to be oriented, so that aab and baa would represent different 

E 

polynomials. The polymers undergo reactions of the form A + B ^ C + H, where 
A, B, C and E are the polymers produced in the chemistry and H denotes water. 
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Bagley and Farmer analyzed the population dynamics of the molecular species 
in a homogeneous (well-stirred) environment and established some important prop- 
erties of autocatalytic self-organization. When the food set (monomers) were 
supplied at a fixed input rate and the chemistry contained an ACS they showed 
that in a suitable range of parameters the concentrations of the ACS molecules 
dominated over the rest of the molecules (the background), thereby focusing the 
chemical resources of the system into a small set of molecules comprising the ACS. 



Bagley, Farmer and Fontana 160 also considered a version of the model wherein 
new species could enter the system stochastically, resulting in an evolution of the 
autocatalytic organization. 

The identity of polymers considered by Bagley and Farmer depends on the 
order in which monomers combine. Thus the number of molecules in their chem- 
istry grows exponentially with the length of molecules (~ /", / is the number 
of distinct monomers, n is the length of the polymer). This prevents one from 
studying large chemistries. In the model presented in this thesis, we ignore the 
order in which molecules combine allowing us to simulate much larger chemistries. 

Though the Bagley-Farmer model was successful in demonstrating a mech- 
anism of autocatalytic focusing, it only discusses this in cases where the largest 
polymers in the ACSs have about 15-20 monomers. Bagley and Farmer did not sys- 
tematically investigate the problems that arise in generating much larger molecules 
in their chemistry. This problem was investigated by Ohtsuki and Nowak 169 in 
a much simpler model in which they showed that for a large molecule to catalyze 
pathways of its own production starting from small food molecules, it is required 
to have a large catalytic strength that grows exponentially with the length of the 
catalyst. However, they did not present a solution to this problem. 

1.4.3.2 A model of network evolution and emergence of organization 

Another motivation for our present model is the work of Jain and Krishna in which 



a simple model of catalytic network evolution was discussed 163, 164, 171 . 

In the model, they consider a prebiotic soup of chemical species of which some 
can catalyze the production of other species, forming a catalytic network. On a 
short time scale, they consider the population dynamics of the constituent chemical 
species. On a longer time scale evolutionary dynamics are considered in which one 
of the least populated species in the system is replaced with a new species. No 
qualitative change to the attractor happens by repeated replacement of species till 
a special structure, an autocatalytic set, arises in the network. An ACS is formed 
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in the network when two or more species mutually catalyze the production of each 
other. Nodes downstream to an ACS also do well populationally, these nodes form 
the periphery of the ACS. This causes nodes outside the ACS to be selected for 
replacement. Soon, the ACS grows to include the entire network. 

This model with its simplicity is able to demonstrate how a complex organiza- 
tion can build around an autocatalytic set over the course of network evolution. 
Importantly it also shows that this organization builds fairly quickly once an ACS 
appears in the system. One of the limitations of Jain-Krishna model is that it 
presupposes a buffered input for all the reactants required to produce the cata- 
lysts. One may envisage situations wherein a catalyst itself acts as reactant in 
some other reaction, causing its population to deplete. This in turn could effect 
the rates of the reaction in which this molecule act as a catalyst. 

One of the motivations of the model discussed in the thesis was to study the 
Jain-Krishna model with the above mentioned resource limitation. In the model 
discussed in this thesis, we show that this affects the dynamics. 

1.5 Thesis organization 

The subsequent chapters in this thesis are organized as follows: 

Chapter 2 introduces our mathematical model describing the population dynam- 
ics of molecules in a stylized but prebiotically plausible artificial chemistry. 
Larger molecules are produced in this chemistry by the coalescing of smaller 
ones, with the smallest molecules, the monomers, assumed to be buffered. 
Some of the molecules produced in the chemistry can also act as the cata- 
lysts for the reactions of the chemistry. They do so by enhancing the rate 
of the reaction in proportion to their concentration by a factor that denotes 
their catalytic strength. Using mass action kinetics we write the dynamical 
equations describing populations of molecules in this chemistry. This chap- 
ter discusses in detail the examples of chemistries that contain one and two 
monomers in the food set. 

Chapter 3 presents dynamical results from the model for the case of spontaneous 
chemistry, i.e., the chemistry in which none of the reactions is catalyzed. It 
shows that the steady state concentrations of the molecules fall exponentially 
with their length. It discusses the dependence of the steady state concentra- 
tions on the various parameters of the model. Analytical results in a region 
of phase space are also discussed. 
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Chapter 4 discusses the dynamics of the model when the chemistry contains 
some catalyzed reactions, and in particular autocatalytic sets. It defines 
autocatalytic sets in context of our model and discusses various dynamical 
features that arise in the system in the presence of an ACS in the chemistry. 
Using a specific example of an autocatalytic set we show that a sparse subset 
of molecules can dominate above the rest of the molecules in the chemistry 
(the background) under certain parameter regimes. We present an analytical 
understanding for this phenomena. We show how a large catalytic strength 
can counteract the effect of dissipation for the molecules produced in cat- 
alyzed reactions. We also show that ACSs can produce bistability in the 
population dynamics. This chapter also discusses the relation between the 
topology of ACSs, system parameters and the existence of bistability. The 
work described in this chapter extends the work of Bagley and Farmer on 
autocatalytic focusing. 

Chapter 5 articulates the problem of producing large catalysts using ACSs. For 
a class of ACSs, referred to as the 'extremal' ACSs, this chapter shows how 
the region of bistability changes with the length of the catalyst. It discusses 
that the catalytic strength required by a catalyst to dominate the chem- 
istry starting from an initial condition relevant to the origin of life problem, 
grows exponentially with the size of the catalyst. This generalizes the results 
of Ohtsuki and Nowak to a larger class of chemistries and emphasizes the 
difficulty in producing large molecules via the ACS mechanism. 

Chapter 6 introduces the mechanism of nested ACSs. It discusses in detail an 
example of two nested ACSs where the smaller ACS reinforces the large one. 
We discuss how the relative topologies of the two ACSs and the background 
chemistry affect the reinforcement received by the larger ACSs. This chapter 
discusses how the mechanism of nested ACSs can be used to overcome the 
problem of producing large molecules presented in Chapter 5. 

Chapter 7 describes two algorithms, based on the discussion in Chapter 6, that 
produce a hierarchy of nested ACSs. It demonstrates that chemistries that 
contain a structure such as nested-ACSs can produce large molecules in 
prebiotically plausible conditions. We show an example where a molecule 
of size 441 {i.e., a molecule containing 441 units of monomer) dominates 
the chemistry starting from a initial condition where only monomers were 
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present and at a catalytic strength that only grows as a power of length of 
the catalysts (as against exponentially needed otherwise). 

Chapter 8 is a discussion of the results. It discusses some prebiotic scenarios 
where our model could be applied. It also discusses limitations and caveats 
of the model and directions for further work. 

Appendix A. Though the model considers an infinite sized chemistry, it requires 
for numerical simulations to consider only a finite subset of the chemistry. 
This introduces an additional parameter N, the size of the largest molecule 
considered. This appendix presents the evidence for the A^- independence of 
results for large N. 

Appendix B. The model discussed in this thesis considers a boundary condition 
in which the food set molecules are considered to be buffered, i.e., their 
concentrations remain a constant over the course of dynamics. We discuss 
in this appendix another boundary condition in which the monomers are 
supplied at a fixed input rate. 

Appendix C discusses the special case of our model under which it reduces the 
model of Ohtsuki and Nowak. 

Appendix D discusses examples of catalyzed chemistries that do not contain 
ACSs and the origin of multistability for an ACS. 

Appendix E presents a discussion on how the topology of an ACS, especially 
the assignment of the catalysts to its reactions, affects the dynamics. 

Appendices F, G and H list the reactions with their respective catalysts for 
various examples of catalyzed chemistry for / = 1 and 2 containing a cascade 
of nested ACSs generated using Algorithm 4 and 5 (described in Chapter [?]). 

Appendix I describes various programs used to generate results reported in this 
thesis. The programs are available from the following link: 



http : //sites . google . com/site/varungiri/research/phd- thesis 
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Chapter 2 

A model of pre-biotic chemistry 



IN this chapter, we describe a model of pre-biotic artificial chemistry in which 
larger molecules are produced by coalescing of smaller ones via spontaneous 
and catalyzed reactions. In the following chapters, we use this model to study how 
large molecules, relevant for the origin of life, can arise in a pre-biotic scenario. 
We explore the model numerically and, in certain regimes, analytically. 

2.1 The model 

The model is specified by describing the set of molecular species, their reactions, 
and the dynamical rate equations for their population dynamics. A special set 
of molecules, the 'food set', denoted 5", consists of small molecular species, / in 
number, that are presumed to be abundantly present in a prebiotic niche. 

J={Ai,A2,...,A^} (2.1) 

A general molecule, say A, is represented as an /-tuple of non-negative integers: 
A = (oi, 02, . . . , a/), where ai is the number of monomers of type A; contained 
in A. The 'size' or 'length' n of the molecule is defined as the total number 
of monomers of all types in it: n = 'Ylii^'i- The food set molecules themselves 
are represented by the /-tuples Ai = (1, 0, 0, . . . , 0), A2 = (0, 1, 0, 0, . . . , 0), . . ., 
A; = (0,0,...,0,1). 

The identity of a molecule in the model is completely determined by the number 
of monomers of each type contained in the molecule; the order in which they appear 
is irrelevant. Thus the combinatorial diversity of distinct compounds containing 
a total of n monomers (of all types) grows only as a power of n (~ n^'^) instead 
of exponentially (~ /" for strings) if the order had mattered. This simplification 
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helps in picturizing the chemistry and significantly reducing the computational 
power needed to explore large values of n. 

Starting from the food set, larger molecules in this chemistry are made through 
a series of ligation and cleavage reactions of the type B + C ^ A, where A = 
(ai, 02, ... , a/), B = (61, 62, • • • , hf) and C = (ci, C2, . . . , c/), with ai = bi + ci. If 
xa denotes the concentration of the molecule A, using mass action kinetics, the 
rate equations are given by 

XA = 0, if A G 3^, (2.2a) 

= ^ fBC - ^ t-AB - 2waa - 0Aa;A, otherwise. (2.2b) 

{B,C)eQA B,B^A 

It is assumed that in a well stirred prebiotic region the concentration of the mem- 
bers of the set 5" are constant (buffered)Q 

vxY = k^Y^xXy - k^Y^z (2.3) 

is the net forward flux of the reaction pair X + Y ^ Z with forward (ligation) 
rate constant fc^Y ^^^ reverse (cleavage) rate constant k^y, 0a is the loss rate of 
A, and Qa represents the set of unordered pairs of molecules which can combine 
together to form A (Qa = {(B, C) : 6/ + q = a, V / = 1, 2, . . . , /}). 



The two terms in the first sum, in Eq. (2.2b), represent the formation (cleav- 
age) of A from (into) smaller molecules. The second sum and the following 
term represent the cleavage (formation) of larger molecules via reactions that 
produce (consume) A. The stoichiometric factor of 2 before f aa arises because 
two molecules of A are involved in the corresponding reaction pair. 

The set of parameters k^y, k^y that are non-zero define the set of possible 
reactions; collectively they define the 'spontaneous chemistry' ('spontaneous' in 
the sense that the reactions are possible even in the absence of catalysts). A pair 
of ligation and cleavage reactions can be excluded from the chemistry by setting 
both k^y and k^y to zero. The scheme permits chemistries in which some reac- 
tions proceed in only one direction (ligation or cleavage) by setting only one of k^y 
and fcxY ^*^ zero. However, we will primarily be interested in a chemistry in which 
each reaction is reversible. The existence of the cleavage reactions makes it more 



^In a prebiotic niche, such as a hydrothermal vent, even as the smah molecules are used 
up in chemical reactions to form other products, natural geochemical processes replenish these 
molecules keeping their concentrations at a significant level. For modeling purposes we assume 
that their concentrations are constant in the region in question. 
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difficult for tlie long molecules to survive; thus it is more significant to demon- 
strate the appearance of long molecules in a model in which cleavage reactions are 
permitted than in one where only the forward (ligation) reactions are. 

2.1.1 Nomenclature 

We classify spontaneous chemistries based on their connectivity and heterogeneity 
of parameters. 

'Reversible' chemistry A chemistry is called 'reversible' if each allowed reaction 
is reversible, i.e., k^ ^ ^^ k^ ^ 0. 

'Connected' chemistry A chemistry is 'connected' if every molecule can be 
produced from the food set in some pathway consisting of a sequence of 
allowed reactions. A chemistry is 'fully connected' if all possible ligation 
and cleavage reactions are allowed: k^y 7^ ^''^^ ^xy t^ ^'^^ ^11 X,Y. A 
fully connected chemistry is a reversible chemistry by definition. A chemistry 
is called 'sparse' if it contains only a small fraction of all possible reactions. 
A sparse chemistry is not necessarily connected. 

'Homogeneous' chemistry A chemistry is called 'homogeneous' if all the non- 
zero rate constants, loss rates, and concentrations of the food set molecules 
are independent of the species labels: /c^y ¥" =^ ^xy — ^/' ^xy ¥" =^ 
^XY — ^r independent of X and Y, 0x = for all X, and, x^i = xp^^ = 

. . . = XAf = ^• 

2.1.2 Inclusion of catalysts 

We consider a simple scheme for inclusion of catalysts. We assume that a molecule 
enhances the rate of a reaction that it catalyzes in proportion to its own concen- 
tration. Thus, if C is a catalyst of the reaction pair X + Y ^ Z, then the rate 
constants of this reaction pair, k^y ^''^^ ^xy-> ^^^ replaced 

^XY -^ ^xy(1 + ^^^xc) and (2.4a) 

k^y^k^y{l + Kl^xc), (2.4b) 

where k^ is the 'catalytic strength' of the catalyst for this reaction pair. The 
first term in the bracket, unity, represents the spontaneous reaction rate (which is 
present irrespective of whether the reaction is catalyzed or not), and the second 
term n^ xq represents the enhancement of the reaction rate due to the catalyst. 
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Note that in this scheme a catalyst enhances both the forward and reverse reaction 
rates by the same factor. If a reaction has muhiple catalysts, Kq^xc is replaced 
by ^(-. Kq^xq, where the sum runs over all catalysts C of the reaction in question. 
Typically, only a small subset of the spontaneous reactions will be catalyzed. The 
set of catalyzed reactions together with the catalysts and their catalytic strengths 
will be referred to as the 'catalyzed chemistry'. 

We refer to a catalyzed chemistry as 'homogeneous' if the catalytic strength 
of every catalyst for each reaction that it catalyzes is the same, i.e., if Kq^ ^ 0, 
then Kq^ = K. 



Eqs. (2.2) in conjunction with (2.3) and (2.4) define a deterministic non-linear 
dynamical system on a complex network. The network structure is specified by 
the nonzero constants /cxy; ^xy? ^^^1 Kq^ . 

2.1.3 Numerical simulations 

While the chemistry under consideration is infinite, numerical simulations were 
done by choosing a finite number A^ for the size of the largest molecule in the 
simulation. For a finite chemistry we exclude all the ligation reactions (and their 
reverse) that produce molecules of size larger than A^. In principle this introduces 
another parameter, A^, an artifact of the simulation. However, we find that most 
properties of physical interest become independent of A^ when A^ is sufficiently 
large (evidence for this is presented later in Appendix |A]) . 

Our numerical work, which comprised of solving coupled ordinary differential 
equations, was mostly done using the CVODE solver library of the SUNDIALS 
(Suite of Nonlinear and Differential/Algebraic Equation Solvers) package v.2.3.(r] 
[l74] , and, for smaller A^ values, using XPPAUT v.6.110 (l75J. Steady states 
obtained were verified using numerical root finders in Octavqj and Mathematica 



176. 



2.2 Chemistries with / = 1 

The simplest version of the model (/ = 1) contains only a single monomer species. 
Here, instead of using a '1-tuple', we represent the molecule of size n by the 
notation A(n) for clarity. The concentration of the monomer, A(l), is represented 



^https : //computation. llnl .gov/casc/sundials/main.html 
^http: //www. math. pitt .edu/~bard/xpp/xpp.html| 



http : //www . gnu . org/sof tware/octave 
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2.2. Chemistries with / = 1 

by Xi{= A) and of the other molecules, A(2), A(3), . . . (dimers, trimers, etc.), is 



given by X2,Xs, . . . respectively. Following Eqs. (2.2), the rate equations for the 
system are given by xi = 0, and, for ra = 2, 3, . . ., 

oo 
Xn / ^ f^ij / ^ V^fi ^Vnn Yn-^n K'^'^J 

oo 

= / ^ [l^ijXiXj — Kj^jXri) + / ^ ['^in-^i+n ~ n^^XiXn) + ^(.fc^n-^Sn ~ '^nn^nl 
i<j,i+j=n i=l,i^n 

- (pnXn, (2.6) 

where Vij = {kfjXiXj — k^Xi+j) is the net forward flux of the reaction pair A(i) + 
A(j) ^ A(z + j), k[j and kj^ being the forward and reverse rate constants, respec- 
tively. Explicitly, 

X2 ~~ [rh^-iX\Xi '^11*^2/ l'^l9'^l*^2 ^12*^3/ \ l^'^l '^22'^4/ 

- {k^2^3X2 - k§2X^) - ... - 02a;2 (2.7a) 
is = (fcf2a;ia;2 - k^^^^) - (A^fgXiXs - fcf3X4) - (fcf3a;2a;3 - ^^33:5) 

- 2 (^^3X3X3 - A;^3X6) - {kl^XiXz - k^^xj) - ... - ^3X3 (2.7b) 

X4 = (fcf3a;iX3 - k^^Xi) + (^^2X2X2 - /C^X4) - (^{4X1X4 - fc^Xs) 

(^fc24X2X4 K24XQJ (^"^343^33^4 "^343'7J ^ (^^44X4 ^443^8^ 

- (^^43^53:4 - ^£3^9) - • • • - 043^4 (2.7c) 



For a finite chemistry, in simulating Eq. (2.6), all terms corresponding to 
reactions in which any molecule larger than A(A^) is produced or consumed are 
omitted. This results in A^ molecules and A^ — 1 coupled ordinary differential 
equations, given by: 

{N-n) 

Xn / ^ yf^ijXiXj Kj^jXnj / ^ y'^in'^i'^i^ "'in'^ *+'!■ / 

i<j,i+j=n i=l,i^n 

~ 2 (fc„„X„ — fi;„„X2nj — (pnXn (2-8) 

for n = 2,3, ... ,N (the third bracket is absent for n > N/2). Note that 00 is 
replaced by {N—n) in the second summation so that the largest molecule produced 



is of length N. For the case of homogeneous chemistry Eq. (2.8) becomes 

Xfi y ^ yrijX{Xj riyXyij ^ ^ yKjX^X^ KjXij^^j 

i<j,i+j=n j=l,j^n 

- 2 [kfxl - Kx2n) - (pXn. (2.9) 



25 



Chapter 2. A model of pre-biotic chemistry 

2.2.1 An example with / = 1, A^ = 6 

The fully connected spontaneous chemistry has 9 reaction pairs of forward and 
reverse reactions: 
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From Eq. (2.8), the 5 equations defining the dynamics are: 



^2 — \ 1 1 "^1 '^11^2/ \'^19'^1^2 12*^3/ \ 92^2 '^22*^4/ 

(^^3X2X3 - ^^3X5) - {k24X2X4 - fc^Xe) - (f)2X2, 
X3 ={k(2XiX2 - ^^22^3) - (k^s^iXs - k^^X^) - {k23X2Xs - k^^x^)- 

'^{kssxl - fc^gXe) - 03a;3, 

X4 =("^133^13^3 ~ "^13-^4) + ("^22-^2 ~ '''22'^4j ~ {ki^XiX4 — k-^^^X^) — 

(/C24X2X4 — n^24'^6) — 4*4X4, 
X5 =(fcf4XiX4 - fcf4X5) + (fcf3X2X3 - /cf3X5) - (fcfgXiXs - k^^Xg) - (P5X5, 

xe ={k(r^XiX5 - fcfsXg) + (^^4X2X4 - k^^xe) + (k^^xj - fcfgXe) - (pe^e- 



(2.10a) 

(2.10b) 

(2.10c) 
(2.10d) 
(2.10e) 



26 



2.3. Chemistries with / = 2 

For a homogeneous chemistry, k[j = kf, k^ = kr and (pi = 0, for all i,j, the 
equations become 

X2 =kfxl — krX2 — kf X2{Xi + 2^2 + X3 + X4) + kr{x3 + 2X4 + X5 + Xq) — 0^2 

(2.11a) 
X3 =kfXiX2 — krX3 — kfX^^Xi + X2 + 2x3) + kr{x4^ + X5 + 2xq) — (px^ (2.11b) 

^4 =kf{XiX3 + x\) — 2krX4 — kfXi{Xi + X2) + kr{x^ + Xq) — 0X4 (2.11c) 

is =kf{xiX4 + X2X3) — 2krX5 — kfX^{xi) + K^xq) — 0X5 (2. lid) 

Xq =kf{xiX5 + X2X4 + X3) — SkrXe — 0X6 (2. lie) 

2.2.1.1 An example of inclusion of catalyst 

Say reaction pairs Rl and R4 of the previous example are catalyzed by A(4) with 
catalytic strengths kI^ and /t|^ respectively. Then the equations for X2 and X4 get 
modified to 

X2 =(1 + Kl^X4){k[^xl - k^^X2) - {k(2XlX2 - ^f^Xs) 

- 2(1 + Kfx4){k^2A - ^222^4) - (A;f3X2X3 - k^^x^) 

- (/i;^X2X4 - /c^xe) - 023^2, (2.12a) 

- (/i;^X2X4 — /i;^X6) — 04X4. (2.12b) 

The equations for X3, X5 and xq remain as before. If the catalyzed chemistry 
is homogeneous, then k\^ = k^^ = k. 



2.3 Chemistries with / = 2 

For a / = 2 chemistry, 5" = {Ai, A2}, a general molecule is given by A = (ai, 02), 
and the monomers are given by the 2-tuples (1,0) and (0,1). There are {n + 1) 
molecules of length n, given by the 2-tuples: (n, 0), (n — 1, 1), . . . , (1, n — 1), (0, n). 
A finite chemistry, has a total of ^ ^ '!^ ^ ' — 1 molecules (including monomers), 

and hence -^ — 3 rate equations. 

Since the number of species goes as A^^/2 and the number of reactions as ~ A^*^ 
for a chemistry with / = 2, computational limitations require us to work with a 
smaller A^ than for / = 1. Qualitative conclusions nevertheless appear to be A^ 
independent. 
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Chapter 2. A model of pre-biotic chemistry 

2.3.1 An example with f = 2,N = 3 

The fully connected spontaneous chemistry has 9 reaction pairs which we label as 
R1,R2 R9. 
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Using Eqs. (2.2 ) suitably modified for a finite chemistry, we can write the following 



rate equations: 



i(0,2) = ^^Rl - ^^R4 - ^^R6 - 0(0,2)3^(0,2) 
i(l,l) = ^R2 - ^^R5 - ^^R8 - 0(l,l)a;(l,l) 
i(2,0) = ^R3 - Vk7 - t;R9 - 0(2,0)3^(2,0) 

i(o,3) = 'yR4 - 0(o,3)a:;(o,3) 

i(l,2) = ^^R5 + ^R6 - 0(l,2)a;(l,2) 
i(2,l) = fR7 + ^^R8 - 0(2,l)a;(2,l) 
2^(3,0) = VRg - 0(3,0)3^(3,0)- 



(2.13a) 
(2.13b) 
(2.13c) 
(2.13d) 
(2.13e) 
(2.13f) 
(2.13g) 



fRj is the net forward fiux of the reaction pair Hi. Eqs. (2.13) can be expanded 



to get the equations in terms of the x's. For example, Eq. (2.13b) becomes 



3^(1,1) = (^R2a;(o,i)a;(i,o) - ^R2a;(i,i)) - (A;R5a^(o,i)a;(i,i) - k^5^{i,2)) 

- (^R8a;(i,o)a;(i,i) - fcR8a;(2,i)) - 0(i,i)a;(i,i) (2.14) 

where, fc^j and k^^ are the forward and reverse rate constant of reaction Ri, re- 
spectively. 

When the catalysts are included, the reactions that involve catalysts are changed 



in the same way as in / = 1 case (see Section 2.2.1.1) 
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2.4. Rate equations in terms of dimensionless variables 

2.4 Rate equations in terms of dimensionless vari- 
ables 



The rate equations (Eqs. (2.2)) can be cast in dimensionless form by introduc- 
ing a concentration scale u and a time scale r. We discuss below the case of a 
homogeneous spontaneous chemistry with / = 1. 

We define dimensionless quantities «„ = x„/aj, t' = t/r, k'^ = kjur, k[. = krT 
and 0' = 0r. Then 

. _ dXn _ d{UnUj) _ udUn , . 

'"''' dt ~ d{t'T) ~ T df ^ ' 

and Eq. (2.6) becomes 

ujdun sr^ f k'f k' \ 

- y~] { —{uJUi){uJUn) '-{uUi+ri)] 

- 2 (^{uUnf - '^{uJU2n)] - "(c^^n) (2.16) 

\UJT T ) ^ 

, oo 

— — = / ^ (^kj-UiUj — kj.Un) — / ^ [k^UiUn — k^Ui^n) 
i<j,i+j=n i=l,i^n 

- 2 {k'fUl^ - k'^U2n) - 0'Mn (2.17) 

Without loss of generality one can choose u = Xi = A and (whenever k^ 7^ 
0) r = 1/kr. Then the dimensionless concentration variables satisfy the same 
equations as before, but with A = kr = 1. There are now only two independent 
dimensionless parameters, k'r and 0'. The dependence on all 4 parameters can 
be recovered at the end by replacing Un by Xn/A, t' by tk^., k'^ by kfA/kr and 0' 
by (p/kr- One may also choose r = 1/0 (whenever 7^ 0). In that case the two 
independent dimensionless parameters will be k'jr and k'^. 

2.5 Comparison with existing models 

The model presented here is closely related to two existing models, one discussed 



by Bagley and Farmer 159, 172 and the other by Ohtsuki and Nowak 169 . The 



differences between the present model and the existing ones is discussed below. 
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Chapter 2. A model of pre-biotic chemistry 

Comparison with the model of Bagley and Farmer 

The main differences between the present model and that studied by Bagley and 



Farmer 159 172 



are: 



A simpler representation of molecules (we do not consider molecules as 
strings): Bagley and Farmer consider a polymeric chemistry in their model. 
The molecules produced in this chemistry are differentiated not only based 
upon their length but also the order in which the monomers appear in 
them, i.e., aab, aba, baa are considered to be three different molecules. We 
have only differentiated molecules based on their length and the number of 
monomers of each type they contain. The order in which the monomers 
appear is not considered. For a given maximum size of molecule N, this re- 
sults in a chemistry that contains much fewer molecules and reactions among 
them, making it possible to numerically investigate chemistries with much 
larger sized molecules. 

A simpler treatment of catalysis (we do not consider intermediate com- 
plexes): They consider a detailed mechanism of catalysis including the pro- 
duction of intermediate complexes. Their dynamical equations included 
equations for these intermediate complexes. We consider an approximate 
and much simpler treatment for inclusion of catalysis. 

We ignore the effects coming from small populations containing a discrete 
number of molecules: Bagley and Farmer discuss the effects that arise be- 
cause of the stochasticity in molecular populations. We consider only con- 
tinuous dynamics and ignore the effects arising out of finite number of 
molecules. 

Their results are at fixed input rate of food set molecules, whereas ours 
are primarily at buffered concentration of food set molecules (we also show 
results for a fixed input rate in Appendix |B|). 



We reproduce the main phenomenon of ACS dominance that Bagley and Farmer 
observed, but the relative simplicity of the present model allows us to explore other 
phenomena that they do not report about (this includes a multistability in the 
dynamics and the possibility of building large molecules through nested ACSs). 
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Comparison with the model of Ohtsuki and Nowak 



The main differences with the model of Ohtsuki and Nowak 169 



are 



• A much richer spontaneous chemistry of ligation reactions and the inclu- 
sion of reverse reactions: Ohtsuki and Nowak considered chemistries that 
only contained forward reactions. Also, larger molecules in their chemistry 
get produced only by successive addition of monomers. We have consid- 
ered chemistries that include reverse reactions and have multiple ways for 
production of molecules. This makes an analytical treatment of the system 
more difficult. 

• A much more general class of catalyzed chemistries, instead of a single cata- 
lyst (which allows us to talk of nested ACSs, in particular): They have only 
considered the cases wherein one molecule catalyses all the reactions in its 
production pathway; they do not discuss cases with more than one catalyst. 
We consider various topologies of catalyzed chemistries and discuss how the 
choice of topology affect dynamics. 

With a specific choice of parameters our / = 1 model reduces exactly to their 
'symmetric prelife' model with a catalyst. This is discussed in Appendix [Cj In 
spite of greater complexity we are able to numerically reproduce their main results 
in a much more general setting, and also provide an analytical understanding of 
the results based on certain approximations. 
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Chapter 3 

Steady state properties of the 
spontaneous chemistry 



THIS chapter discusses the behaviour of the model described in Chapter |2] 
when the chemistry only contains spontaneous reactions. We first discuss a 
spontaneous chemistry that is fully connected and homogeneous. This chemistry 
has 4 parameters, kf, k^, 4>, and A, the concentration of the monomers. The results 
for = are discussed first, for which we also present some analytical results. 
We then describe results for a general (p and show that the concentration of large 
molecules fall exponentially with their length. We have checked that introducing 
irreversible reactions and bringing in a small amount of heterogeneity does not 
change the conclusions. Some results for sparse and heterogeneous chemistries are 
also presented. 

In this chapter, and the remainder of this thesis, we discuss in detail results for 
the chemistries with / = 1. We also give some results for / = 2. The qualitative 
features of the model can be generalized for any values of /. 

3.1 Behaviour of the system in the limiting case of 

= 



3.1.1 Analytical solution 

For = 0, the following exact analytical solution for the steady state concentra- 
tions exists for homogeneous and connected uncatalyzed chemistries with / = 1: 

x„ = aM^ . (3.1) 
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Chapter 3. Steady state properties of the spontaneous chemistry 



To see that this is a fixed point, note that when Eq. (3.1) holds, then Vij = 
kfXiXj — krXi^j = for all i,j = 1,2, . . .; hence the r.h.s. of Eq. (2.5) vanishes 



(at = 0). The same argument applies for a general /; for / = 2 the expression 
becomes, 



4 ni 7^712 j •'"/ 



(ni+n2)-l 



(3.2) 



where A = X(i,o) and B = X(o,i) are the concentrations of the two monomers. 
In the case when the two monomers have same buffered concentrations (case of 
homogeneous chemistries), i.e., A = B, Eq. (3.2) reduces to Eq. (3.1) with 
n = ni + n2- 



3.1.2 Numerical results 

Starting from the initial condition in which all concentrations other than that of 
the food set molecules are zero, i.e., 



Xr 



0,n = 2,3,4, 



(3.3) 



(we refer to this as the 'standard' initial condition), in a fully connected and 
homogeneous spontaneous chemistry with / = 1 and = 0, the concentrations 



were found to increase monotonically and reach a steady state (Fig. 3.1a). The 



plot of steady state concentration x„ versus n is shown in Fig. 3.1b The graph of 
Xn versus n on a semi-log plot (inset of Fig 3.1b) was found to be a straight line, 
given by the expression 



Xr 



ck' 



ce 



-7(n) 



(3.4) 



where, c and A = e '^ are constants. A(0 = 0), determined by numerically fitting 



the slope, was found to be equal to kjA/kr, consistent with the Eq. (3.1). Hence, 
whenever kfA < kr, the steady state concentrations are exponentially damped. 
When kfA > kr, the steady state concentrations increase exponentially (for kfA > 
kr, the numerical integration does not converge as the steady state solution is 
numerically very large for large n). 

Similar results hold when more than one monomer is present in the system. 
For / = 2, the steady state concentration profile is shown in Fig. |3.2a[ when the 



concentration of the two monomers is the same and in Fig. 3.2b , when they have 
have different buffered concentrations. The results are found to be consistent with 



theEq. (3.2) 



Note that these results are independent of the size of the largest molecules 
in the chemistry, A^; x^'s are completely determined by the values of parameters 
kf, kr and A. 
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3.1. Behaviour of the system in the hmiting case of = 
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Figure 3.1: Concentrations in uncatalyzed chemistries with a single food source for 

4> — 0. (a) Evolution of concentrations with time for a chemistry with kf = 0.9, kr = A = 1. 
(b) Steady state concentration x„ as a function of molecule size n. The concentrations fall 
exponentially with length. Inset shows the same on a semi-log plot; the straight lines are 



evidence of exponential damping of a;„ (Eq. (3.4)), with A — 1,0.9,0.75,0.5 for the four cases, 
respectively. A is computed from the slope of a straight line fit. The value of parameters are 
indicated in the key. For simulation purposes, the size of the largest molecule was taken to be 
N = 100. 
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Figure 3.2: Concentrations in uncatalyzed chemistries with two food sources for 

= 0. The 3D plots show the steady state concentration a;„ of the molecules n — (rii, 71,2) as a 
function of ni and 712. The insets show a 'top view' of the (ni, 712) plane with a;„ indicated in a 
colour map on a logarithmic scale, (a) The two monomers have same buffered concentrations, 
2^(1,0) = ^{0,1) = 1 (homogeneous chemistry), kf = l,kr = 2, N = 40. (b) Two monomers have 
different concentrations, kf ~ kj. — l,a;n,o) = 0-75,a:(oi) — 0.5, N — 40. The results in two 



cases are found to be consistent with the Eq. (3.2) 
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3.2. Behaviour of the system for > 

3.2 Behaviour of the system for > 

When > we do not have an exact analytical solution. As in the case for = 0, 
for a fully connected and homogeneous spontaneous chemistry (/ = 1), starting 
from the standard initial condition (Eq. (|3.3)), the concentrations were found to 



increase monotonically and reach a steady state (Fig. 3.3a). The steady state 
values, which now depend upon the value of 0, were comparatively smaller. The 
plot of steady state concentration a;„ versus n for different values of is shown 



in Fig. 3.3b The semi-log plot of x„ versus n (inset of Fig 3.3b) was now found 
to be approximately a straight line for large n. The small molecules deviate from 
the straight line. A, calculated in this case by fitting the slope ignoring small 



molecules, decreases monotonically as increases (Fig. 3.3b). 



The ratio of steady state concentration of successive molecules, Xn+i/xn (= A 



as given by Eq. (3.4)), is not strictly a constant but depends upon n (Fig. 3.4). 
The ratio oscillates about a mean, and is larger when n + 1 is even (more so 
when it is a power of 2) than when n + 1 is odd; the molecules that are made up 
of two smaller molecules of the same size tend to have higher populations. The 



variation becomes quite small as n increases (insets of Fig. 3.4a). Thus Eq. (3.4) 
can still be used to approximate the solution that characterizes the steady state 
concentrations. Numerically, we find that A(0 > 0) drops to below 1 even when 
kfA/kr > 1. 

Similar results hold for / = 2; steady state concentration profiles for two 



different values of are shown in Fig. 3.5 'Diagonal entries' {rii = ^2) have 



higher concentrations in homogeneous chemistries because there are more reaction 
pathways to build molecules with equal numbers of both monomers than unequal, 
and the larger number of production pathways for the diagonal molecules are able 
to counteract the effect of dissipation more effectively. 

The values of A extracted from the slope of the straight line fit are found to be 
A^ independent for sufficiently large A^. A detailed discussion for A^-independence 
of this and other results is provided in the Appendix |Xj 

3.2.1 Dependence on parameters 

A is found to be a monotonically increasing function of kf and A, and a monoton- 
ically decreasing function of fc^ and 0. This corresponds to the intuition that an 
increased ligation rate favours large molecules and an increased cleavage or dissi- 
pation rate disfavours them. Recall the discussion on casting the rate equations 



in terms of dimensionless variables presented in Section 2.4 For homogeneous 
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Figure 3.3: Concentrations in uncatalyzed chemistries with a single food source. 

(a) Evolution of concentrations with time for a chemistry with kf = kr = A = (l) = l,N = 100. 
Dashed curves show the same when </> = (other parameters remaining the same). For small t 
the (f) = 1 curves are similar to = 0; as the concentrations become large the effect of finite (p 
starts to show, (b) Steady state concentration as a function of molecule size. Parameters take 
the same values as in (a) except that four values of (j) are shown, (f) — 0,0.1, 1, 10. Inset shows 
the same on a semi-log plot; A = 1, 0.77, 0.49, 0.16 for the four cases, respectively. A is computed 
from the slope of a straight line fit after ignoring the smaller molecules (up to n = 4 in this 
case) . 
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Figure 3.4: The ratio of steady state concentration of successive molecules, a;„+i/a;„, 
versus n. (a) The ratio x„+i/x„ is not strictly a constant but depends upon n. It oscillates 
about a mean, becoming larger for even values of n + 1 (powers of 2 show even larger variation). 
For larger values of n, the variation in ratio becomes small (see insets) and thus the ratio can be 
approximated by a constant. The effects arising because of the finitcncss of chemistry are also 
visible when n ^ N . Parameters take the same values as in Fig. 3.3a (b) The ratio x„+i/a;„ 



versus n is plotted for 5 different set of parameters (values are indicated in the key). 
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Figure 3.5: Steady state concentration profile in an uncatalyzed chemistry with / = 

2. The 3D plots show the concentration Xn of the molecule n — (ni, 712) as a function of rii and 
77,2 in the steady state, for an uncatalyzed chemistry with kf = kj. = 2^(1,0) = ^(0,1) = 1,-/V = 40 
for two values of </) = (a) 1 and (b) 10. The inset shows a 'top view' of the (771,77,2) plane with 
Xn, indicated in a colour map on a logarithmic scale. 



40 



3.3. Results for a sparse chemistry 




Figure 3.6: Behaviour of A as a function of kf and (p. The figure shows the dependence 
of A on kf and (/> for an uncatalyzed chemistry, keeping A = kr = I, N = 100. The curved 
surface was made with parameter values in the range < kf < 2, 0.01 < < 2. A is found to 
be a monotonically increasing function of kf and a monotonically decreasing function of (/). For 



(f) — 0, there is an analytical solution A — kf (see Eq. (3.1 1). This was verified numerically in 
the range < fc/ < 1 (see solid line at (j) — 0). In the region kf > 1 the numerical integration 
does not converge at = as the steady state solution (Eq. (3.1)) a;„ — AA"^^ 



Ak 



f 



numerically very large for large n. The dotted extension of the line (1 < fcy < 2) is simply the 
analytical result. Note that for most of the phase-space A < 1, except for very small values of <j). 



chemistries, whenever kr ^ 0, the system can be described by two dimensionless 
parameters, k'r and 0'; or whenever 7^ 0, by k'r and k'^. The behaviour of A as a 
function of kf and (p (keeping A = kr = 1) is shown in Fig. 3.6, and as a function 
of kf and kr (keeping A = (p = 1) is shown in Fig. 3.7 

When kr = and = both the above choices for scahng faiL In this case the 



system has no steady state for any finite A^ as xn is always positive (in Eq. (2.8) 
for n = N, the longest molecule has only a positive production term, thus system 
has no steady state). In all other cases the uncatalyzed chemistry seems to have 
a global fixed point attractor (all initial conditions tested lead to the same steady 
state). 



3.3 Results for a sparse chemistry 



As defined earlier (in Section 2.1.1) a sparse chemistry is one that contains only 
a small subset of all possible reactions. There are several ways of constructing a 
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Figure 3.7: Behaviour of A as a function of kf and fc^. The figure shows the dependence 
of A on kf and kr for an uncatalyzed chemistry, keeping A — <j) — 1, N ~ 100. A is found to be 
a monotonically increasing function of kf and a monotonically decreasing function of kj.. 



sparse chemistry; we use the following algorithm: 

Algorithm to generate sparse chemistries: We prune the reaction set to only 
k ligation reactions per molecule, randomly chosen from all possible ligation reac- 
tion producing the molecule. For molecules too small to have k ligation reactions, 
all ligation reactions are retained. (Note that while we refer to only the ligation 
reactions and not cleavage reactions for the purpose of defining the degree of a 
molecule, in our simulations all reactions are treated as reverse reactions. That 
is, for every ligation reaction included in the chemistry the reverse (cleavage) re- 
action is also present.) Thus we define a chemistry with degree k as one in which 
the number of ligation reactions producing a molecule is k obtained as a result of 
above pruning algorithm. 

For a sparse spontaneous chemistry, the plot of steady state concentration Xn 



versus n is shown in Fig. 3^, The steady state concentration of molecules are 
smaller in a chemistry with smaller average degree as compared to fully connected 
chemistry as the number of ways is which a molecule can be produced are fewer. 



3.4 Results for a heterogeneous chemistry 

In the earlier sections we have discussed chemistries in which all forward and 
reverse rate constants take same values, respectively. In real chemistries, the rate 
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Figure 3.8: Concentrations in sparse spontaneous chemistries, (a) Steady state con- 
centration Xn as a function of n is shown on a semi-log plot for sparse chemistries with / = 1 
and degrees, k — 1,2 and 10. Also shown for comparison are the steady state profiles for a fully 
connected chemistry A - 
sparse chemistry with / 



i\i f — nJy-. — 

2 and degree, k = 1. kj 



1, N = 100. (b) Steady state concentration profile for 



^(1,0) 



=(0,1) 



1, 



10, N = 40. 



In comparison to a fully connected chemistry, the steady state concentrations are smaller for a 
sparse chemistry. 
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Ep+E^ 




E.+E, 



(a) 



(b) 



Figure 3.9: Activation energy for a reversible reaction pair. For a reversible reaction 
pair, A + B ^^ C, the activation energy for forward reaction is defined as E^ and for reverse 
reaction E^. Ep^ + E^ represents the energy of the reactants and Eq, energy of product. 
The barrier height is defined as the lower of two activation energies, (a) Ebarrier = E^ . (b) 



Eba 



Ei 



constants depend upon various physical parameters such as the energy of the 
reactants and products, the activation energy of the reaction, etc. The difference 
in the energies of the reactants and the products determine the ratio of forward 
and reverse rate constants. Whereas, their actual value is determined by the 
activation energy of the reaction, and is given by 'Arrhenius' equation. 



k = Ze 



-Ea 



(3.5) 



where, Z is the pre-exponential factor and Ea is the activation energy of the 
reaction in the units of fcsT. 

For a reversible reaction, A + B ^ C, the two rate constants are given as. 






ZiC 



' '^AB 



Zoe" 



-eR 



(3.6) 



where, E^ {E^) is the activation energy for the forward (respectively, reverse) 
reaction. The difference between E^ and E^ is equal to the difference between 
the energy levels of the reactants and the products (as defined for the forward 
reaction), i.e., E^ — E^ = Eq — {Ea + Eb). Ex is the energy of molecule X. For 
every reaction pair we define barrier height as the smaller of the two activation 



energies (Fig. 3.9). 

We employ following scheme to assign reaction rates. We first assign each 
molecule an energy. For a molecule of length n = 2,3,4, .. ., the energy. En, is 
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given as, 

En = nei ± r]ie2, (3.7) 

where, ei is the energy of the monomer, Ei = ei, and 62 is the scale of variation 
in energies. All energies are expressed in units of ksT. rji is a random number in 
[0, 1]. We then assign barrier height to each reaction A + B ^ C as 

Ebarrier = V2^3, (3-8) 

where, €3 is the scale of the barrier heights and 772 is a random number in [0, 1]. 
Ebarrier IS uscd to determine the forward and the reverse activation energies using 
the energies of the reactants and the products as. 



if Ea + Eb < Ec, E^ = {Ea + Eb) + Ebarrier, E^ = Ebarrier (see Fig |3.9a[ ), and 
if Ea + Eb > Ec, E^ = Ebarrier, E^ = {E^ + Eb) + Ebarrier (sec Fig 3.9b). 



The reactions rates are calculated using Eq. (3.6). Thus the reaction rates are 
determined by the parameters ei, e2, €3, Zi, Z2, and the random numbers rji (one 
for each molecule) and ri2 (one for each reaction). 

The Steady state concentration profile for a heterogeneous spontaneous chem- 



istry with / = 1 is shown in Fig. 3.10a It shows that even in the presence of a 



small amount of heterogeneity in the system, while there are ups and downs in the 
steady state concentration profile, the steady state concentrations of molecules still 
generally fall exponentially with their lengths. We still find a single global fixed 
point attractor. The energies assigned to the molecules are shown in Fig. 3.10b| 



and the forward and reverse reaction rates in Fig. 3.10c and 3.10d respectively. 

We remark that instead of the above scheme for choosing k^B and k^B if ^^ 
simply choose k^^ = kf + 77^^, /c^g = kf + r/^g where 77^3 and r/J^g are random 
numbers drawn uniformly from the range [—rj, 77] for some small t], then a behaviour 



similar to Fig. 3.10a is obtained. 



3.5 Summary 

We show that for the model presented in Chapter |2| in presence of buffered food 
set molecules, the concentrations of large molecules produced via a spontaneous 
chemistry decline exponentially with the size of the molecule. This system is 
found to have a global fixed point attractor. We also discussed chemistries that 
are sparse and heterogeneous. We find this qualitative feature is robust to small 
heterogeneity present in the chemistry and also holds for sparse chemistries. It 
may be interesting to relate this model to work on monotone dynamical systems 
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Figure 3.10: Heterogeneous spontaneous chemistries with single food source. Con- 
tinued. 
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Figure 3.10: Heterogeneous spontaneous chemistries with single food source. The 

forward {k[A and reverse (kj^) reaction rates for every reaction (A(i) + A(j) ^ A{i + j)) were 
assigned using parameter values ei — 62 ~ £3 = Zi = Z2 = 1- (a) Steady state concentration 
Xn as a function of n. Inset shows the same on a semi-log plot. A = cj) = 1, N = 100. (b) 



Energy, E^ of the molecule of length n assigned using Eq. (3.7). The value of (c) kf^ and (d) 



kf!j indicated in a color map on a logarithmic scale. The solid black line in (c) indicates all the 
forward rate constants for the reactions which produce A(80) {i + j = 80). 
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177, 178 . In subsequent chapters we discuss the role of catalysis in production of 



large molecules. 
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Chapter 4 

Catalyzed chemistries and 
autocatalytic sets 



THIS chapter discusses the model when the chemistries contain some catalyzed 
reactions in addition to the spontaneous reactions. We study the dynam- 
ics in presence of 'autocatalytic sets' (ACSs). We begin by defining ACSs and 
then discuss various dynamical properties exhibited by the chemistries containing 
them including the existence of mult ist ability. We show how a small subset of 
molecules can be produced in large concentration (compared to other molecules 
in the chemistry) in presence of an ACS. 

4.1 Autocatalytic Sets (ACSs) 

Consider a set 5* of catalyzed one-way reactions. 'One-way' means that each 
reaction in S is either a ligation or cleavage reaction. Thus the set of reactants 
and the set of products are unambiguously defined for each reaction and the two 
sets are distinguished. The presence of a given ligation or cleavage reaction in 
S does not mean that its reverse is also necessarily a member of 5*. Let P{S) 
be the union of sets of products of all reactions in S, and R{S) the union of 
sets of reactants of all reactions in S. We exclude the food set molecules from 
both P{S) and R{S). We will refer to the set S of catalyzed reactions as an 
autocatalytic set (ACS) if (a) P{S) includes a catalyst for every reaction in S, 
and (b) R{S) C P{S). The latter condition implies that all members of R{S) 
can be produced from the food set by (recursively) applying reactions from within 
S. An ACS thus ensures the existence of a catalyzed pathway, starting from the 



food set, for the production of each of its products 155-157 . Alternative valid 
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definitions of an ACS can be given (see 91,93 for one such); the above definition 
suffices for our present purposes. 

Note that if S is an ACS, then its extension, S', that additionally includes the 
reverse of some reactions in S, is also trivially an ACS, as in our scheme a catalyst 
works for both forward and reverse reactions if both exist in the chemistry. 



4.2 An example of chemistries with autocatalytic 
sets 

As a specific example to display certain generic properties, we consider the fol- 
lowing catalyzed chemistry (for a diagrammatic representation of the network 



topology see Fig. 4.1) 



A(9) 



(4.1a) 
(4.1b) 
(4.1c) 
(4.1d) 
(4.1e) 
(4.1f) 

(4.1g) 
(4.1h) 



Note that this set of reactions constitutes an ACS (which we will refer to as 
ACS65). If any one reaction pair is deleted from the set, it is no longer an ACS. For 
the moment, for simplicity, we consider the case where the catalytic strengths of 



A(l) + A(l) 




A(2) 


A(2) + A(2) 


A(5) ^ 


A(4) 


■n. 


A(l) + A(4) 


A(28) ^ 


A(5) 


A(4) + A(5) 


A(14) ^ 


A(9) 


A(5) + A(9) 


A(37) ^ 


A(14) 


A(14) + A(14) 


A(37) ^ 


A(28) 


A(9) + A(28) 


A(65) -^ 
\ 


A(37) 


A(28) + A(37) 


A(14) ^ 
\ 


A(65) 



all the catalyzed reactions are equal ('homogeneous' catalytic strengths): k. 



1,1 



.2,2 



1,4 



4,5 



5,9 



14,14 



9,28 



28,37 



Ke: — ^^28 — ^14 — ^37 — ^^37 — ^65 — ^14 — ^' anci all otuer t^^jj — 
0. (For clarity, in view of double digit indices, we have introduced a comma 
between the pair of indices in the superscript.) Fig. 4.2a describes the steady 



state concentrations, starting from the standard initial condition (Eq. (3.3)), for 
the chemistry that contains these eight catalyzed reactions in addition to all the 
reactions of the fully connected spontaneous chemistry. At k = 2.5 x 10^ the ACS 
product molecules 'dominate' over the background (the 'background' being defined 
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Figure 4.1: Pictorial representation of the catalyzed chemistry in Eqs. (4.1), re- 



ferred to as ACS65. This is a directed bipartite graph with two types of links. Circular 
nodes represent molecules and rectangular nodes represent reactions. The label inside the nodes 
identify the nodes (molecule size n for circular nodes and reaction sub-equation label of Eq. 



(4.1) for rectangular nodes). A black solid arrow from a molecule to a reaction node indicates 
that the former is a reactant in the latter, and one from a reaction to a molecule node that 
the latter is a product of the former. A red dashed arrow from a molecule to a reaction node 
indicates that the former is a catalyst for the latter. To avoid visual clutter some black arrows 
starting from molecule nodes are shown to branch out into more than one arrow. (For example, 
the arrow from molecule node 5 branches into reaction nodes d and e; this means that molecule 
5 is a reactant in both reactions. This structure should not be construed as a bi-directional 
link between reaction nodes d and e.) The figure only represents the ligation reactions in the 
catalyzed chemistry; the reverse (cleavage) reactions are not shown. 
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as the set of all molecules except the ACS product molecules and the food set), in 
the sense that the ACS molecules have significantly larger populations than their 



'neighbours', namely, the background molecules of similar size 159 . There is a 



fairly sharp threshold value of k above which ACS domination appears, as evident 
from the comparison with the lower curve in Fig. 4.2a drawn for n = 2.0 x 10^. 



Time evolution of concentrations for a few ACS and background molecules is 



shown in Fig. 4.2b As the right catalysts and/or reactants become available the 
concentrations of ACS molecules jump sharply to higher values. 



Fig. 4.3 shows that the steady state background concentrations decline as 
increases, while the ACS concentrations are relatively unaffected in this regime 
(thus ACS domination increases). If catalyzed production pathways from the 
food set to other molecules are broken somewhere, the concentration of the latter 



molecules declines significantly. This is evident from Fig. 4.4 for which only one 



reaction pair (4.1a) is deleted from the catalyzed chemistry (which now contains 



no ACS) while others are catalyzed at the same strength as before. 

ACS domination at a sufficiently high catalytic strength also occurs when there 
is more than one monomer. An example with / = 2, referred to as ACS (8, 10), is 
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Figure 4.2: Steady state concentration profile for ACS65 (Eqs. (4.1)). kf = h 



A = 1,0 = 15, A^ = 100. (a) The steady state concentration profile for two values of n,. There 

exists a threshold value of k above which ACS dominates the background. For k — 2.5 x 10^ 

ACS molecules have significantly higher concentrations as compared to their neighbours. For 

a lower value of k (= 2 x 10^) this phenomenon disappears and the system behaves as if it 

only contains spontaneous reactions, (b) Evolution of concentration with time for some ACS 

molecules (n — 2, 4, 5, 28, 65) and background molecules (n — 10, 20) is shown for k = 2.5 x 10^. 

The concentration of ACS molecules sharply rise to higher values as and when right catalysts 

and/or reactants get produced in sufficient concentrations. The dotted line shows the time 

evolution of A(2) for k = 2 x 10^. 
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Figure 4.3: Steady state concentration profile for ACS65 for different values of (p. 

The concentration profile for four values of = 1,5, 10, 15 is shown for k — 3.0 x 10^. As (f> 
increases the concentration of molecules in background decreases, though the concentrations of 
ACS molecules remains largely unaffected. 



4.2.1 Understanding why ACS concentrations are large (the 
Av — > oo limit) 

The above features are generic for a large class of ACSs. It is instructive to consider 
the K — )■ oo limit which we discuss analytically. We discuss the case when / = 1 
but the argument can be extended to a general /. When k, is nonzero, the terms 



in Eq. (2.5) corresponding to catalyzed reactions get modified. The net flux v of 



such reaction pairs on the r.h.s. (for brevity we are omitting the subscript ij in Vij) 
is replaced by (1 + i^J2m-^rn)v, where the sum over m is a sum over all catalysts 
of the reaction pair. Now let the set S of catalyzed reactions be an ACS. Then, 
if A{n) G P{S) the r.h.s. of x„ contains at least one such catalyzed term, while 
if A{n) ^ P{S) Xn contains no such term. For example, for the above mentioned 
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Figure 4.4: Steady state concentration for ACS65 with reaction (4.1a) deleted. 

kf — kr — A — 1, (/) — 15, N — 100. The concentration profile for k ~ 2.5 x 10^ but with reaction 



(4.1a) removed from the ACS65 (red curve) compared with the profile for the spontaneous 
chemistry, k = (green curve). The inset shows the same with Xn on a logarithmic scale. On 
the linear scale the two curves are indistinguishable. 
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Figure 4.5: Steady state concentration profile for ACS(8,10) in a chemistry with 

/ = 2. The 3D plots show the steady state concentration Xn of the molecules n = (rii, 712) as a 
function of ni and n2- The inset shows a 'top view' of the (ni, 712) plane with x^ indicated in a 
colour map on a logarithmic scale, kj = kr — 2^(1,0) — 2^(0,1) = Ij = 10, k = 10®, N — 40. 
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ACS, ACS65, we have 

X2 ~ ^(xgf 1^1 — 2a;5t;2,2) + (terms independent of k) (4.3a) 

Xi ~ K,{x5V2,2 - X28V1A " a;i4t^4,5) + (f^^ terms) (4.3b) 

is ~ K,{x2sVi,A - Xi4V4^5 - XsjV^^g) + {k^ terms) (4.3c) 

xg ~ fi;(xi4W4,5 - 3:37^^5,9 - a;65^^9,28) + (/t° terms) (4.3d) 

i;i4 ~ K(x37f5^9 — 2x37^14^14) + (fi:° terms) (4.3e) 

±28 - /t(x37t;i4,i4 - a;65t^9,28 " a;i4t^28,37) + (/t° terms) (4.3f ) 

±37 ~ /t(a;65^^9,28 " a;i4^^28,37) + (/«° terms) (4.3g) 

±65 ^ «(a;i4f 28,37) + («° terms), (4.3h) 

while the rate equations for all other (non ACS) molecules (±3, xq, etc.) have 



no terms proportional to k. In a steady state solution the r.h.s. of Eqs. (4.3) is 
zero, and to leading order in the k — )■ 00 limit we must set the coefficients of n to 
zero. The coefficients involve only the ACS fluxes Vij and catalyst concentrations. 
Each coefficient is a sum of terms, and each term is proportional to an ACS 
flux Vij. Thus Vij = for the ACS fluxes provides a steady state solution in 
the K — !■ 00 limit. Numerically we flnd that when k, is sufficiently high the rate 
equations converge to this solution starting from the standard initial condition. 
Now Vij = kfXiXj — krXi^j, therefore Vij = implies Xj+j = kfXiXj/kj. for the 
members of P{S). Since by definition there is a catalyzed pathway from the food 
set to every ACS product, we can recursively express the steady state concentration 
of every ACS molecule in terms of Xi = A: x„ = AikfA/kj.)"^'^. 

It is evident that this argument applies whenever the set S of catalyzed reac- 
tions is an ACS; thus for every member of P{S), x„ ~ A{kfA/kr)"'~^ is a steady 
state solution of the rate equations in the limit k — > 00. This is corroborated 



numerically: in Fig. 4.3 since A = kf = kj- = 1, all the eight ACS products should 
have x„ = 1 in this limit; the numerical result at k = 3 x 10^ is not too far from 
this limiting analytical value. 

A strong ACS counteracts dissipation 



Recall from Eq. (3.1) and the discussion in Section 3.1.1 that every molecule in 
a homogeneous connected uncatalyzed chemistry has the steady state concentra- 
tion Xn = AlkfA/kr)""'^ when there is no dilution flux or dissipation (0 = 0), 
and a smaller concentration when there is dissipation (0 > 0). We have observed 
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4.2. An example of chemistries with autocatalytic sets 

above that an ACS with a sufficiently large k can boost the steady state con- 
centrations of its members, even when > 0, to the same level. The expression 
Xn = A{kfA/kr)^~^ seems to represent an upper limit on the steady state concen- 
tration of A{n), which can be approached either when dissipation goes to zero, 
or, when there is dissipation, by membership of an ACS whose catalytic strength 
becomes very large. 

When the reaction pair A(l) + A(l) ^ A(2), in ACS65, is not catalyzed 
the production of A (2) takes place at a much smaller rate, the spontaneous rate. 
Therefore its concentration is much smaller, and hence so are the concentrations 
of the larger molecules. 

When A(n) belongs to the background the r.h.s. of Xn contains no term 
proportional to k, and all the ^-independent terms have to be kept, including 
the (j)Xn term. Thus its steady state concentration depends upon 0, and as in the 
case of the uncatalyzed chemistry, declines more rapidly with n when increases. 

4.2.2 Multistability in the ACS dynamics and ACS domination 

The reason for the sudden change in the qualitative character of the steady state 



profile as k is increased (as shown in Fig. 4.2a) is a bistability in the chemical 
dynamics due to the presence of the ACS. Fig. 46 shows three regions in the 
phase diagram of the chemistry containing ACS65, separated by values k^ and 
K^^ of K. For < K, < K,^ (region I), the dynamics starting from both the initial 
conditions mentioned in the figure caption converged to the same attractor con- 
figuration, which is a fixed point in which the large ACS molecules have a very 
small concentration which are comparable to the background concentrations (the 
concentration declines exponentially with n). For n^^ < n (region III), again they 
converge to a single attractor, a fixed point in which the ACS molecules have a 
significant concentration which approaches x„ = A{kfA/kr)^~^ as k — t- oo while 
the background molecules have significantly lower concentration. In the range 
K,^ < K < n^^ (region II), they converge to two different stable attractors, both 
fixed points for the ACS under discussion. 

This phase structure implies that if we start from the standard initial condition 
and consider the steady state profile to which the system converges for different 
values of k, we will see a sharp change in the steady state profile as k, is increased 
from a value slightly below k,^^ to a value slightly above k,^^ . Below k^^ the large 
ACS molecules will be essentially absent in the steady state, and above k^^ they 
will be present in large numbers and will dominate over the background. 
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Figure 4.6: Bistability in the dynamics of ACS65. 'Hysteresis curve' of the steady state 



concentration of A(65) versus k for k 



f 



A = 1,< 



15, A^ — 100. The curve is obtained 



by using two different initial conditions (i) the standard initial condition a;„ = for all n > 2, 
and (ii) a 'high' initial condition x^ = 1 for all n > 2. In region I {k < k^ — 6617) both 
initial conditions lead to a single fixed point in which xq^ is very low, 10~^°. In region III 
(k > K^' = 2226342) both initial conditions again lead to a single fixed point but in this fixed 
point xgs is high, close to unity. In region II (k^ < k < k^^) the initial condition (i) leads to 
the lower fixed point and (ii) leads to the upper one. The transitions are very sharp, e.g., at 
K = 2226341 the system is numerically clearly seen in region II and at 2226343 in region III. 



58 



4.3. Bistability in simple ACSs 



Therefore, following the nomenclature of Ohtsuki and Nowak 169 , who ob- 
served a similar bistability in their model with a single catalyst, we refer to k^^ 
as the 'initiation threshold' of the ACS. Similarly k^ will be referred to as the 
'maintenance threshold' of the ACS, because once the ACS has been initiated, k 
can come down to as low a value as k'^, and the ACS will continue to dominate. 

We remark that using other initial conditions we have found at least one more 
stable fixed point in a part of region II which has intermediate values of Xes (Fig. 



4.7a indicating that this system has multistability. The steady state concentration 



profile for a system in the intermediate attractor is shown in Fig. 4.7b An intuitive 



reason for appearance of this attractor is discussed in Appendix |D} 

4.3 Bistability in simple ACSs 

In general k^ and n^^ depend upon the other parameters, as well as the topology 
of the catalyzed and spontaneous chemistries. The phase structure is exhibited 



in more detail for a simpler example in Fig. 4.8, where the catalyzed chemistry 
consists of only two reaction pairs: 

"'*' - ■ '"- (4.4a) 



A(l) + A(l) 


^-^ A(2) 


A(2) + A(2) 


^^^ A(4) 



(4.4b) 

which constitute an ACS (called ACS4). This system, investigated numerically 
using XPPAUT, shows bistability. For a fixed (j) the bistability diagram is shown 



in Fig. 4.8a The dependence of k,^,k,^^ on is exhibited in Fig. 4.8b For a 
given kf, there is critical value of 0(= 0) at which the k^ and k,^^ curves meet, 
below which there is no bistability. The locations of the phase boundaries, the k^ 
and K^^ curves, depend upon the specific underlying chemistries (catalyzed and 
spontaneous) as well as the ACS topology. The steady state profiles are shown at 



sample points in the phase space in Fig. 4.8c For > it can be seen, that as in 



the case of the larger ACS discussed earlier, if we start from the standard initial 
condition, the largest molecule of the catalyzed chemistry, here A (4), dominates 
over the background in the steady state only for k, > n^^ (e.g., the panel marked 3 



in Fig. 4.8c). In the range k^ < n < n^^ , it dominates only if we start from initial 
conditions where it has a large enough value to begin with (panel 2b), but not 
if we start from the standard initial condition (panel 2a). It does not dominate 
for any initial condition ii k < k^ (panel 1). If is below 0, there is a single 
attractor with no significant ACS dominance if k is small (panel 4), or if k is large 
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Figure 4.7: Multistabihty in the dynamics of ACS65. (a) Same as Fig. 

additional stable state in Region II shown. This steady state is achieved by initial conditions 



that are intermediate to the ones discussed in Fig. 4.6 (b) The steady state concentration 
profile in the intermediate steady state, kf — kr = A = 1, (p = 15, k = 5 x 10^, A^ = 100. The 
initial condition used is Xi = 10^^ ior i = 2,3, . . . ,N. 
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4.3. Bistability in simple ACSs 

(panel 5), ACS dominance exists but is not very pronounced as the background 
concentrations are also substantial. The dependence of k^, k^^ on both cf) and kf 
is shown Fig. 4.8d For a smaller /cj, bistability appears at smaller values of 0r] 



We remark that while bistability seems to be quite generic in homogeneous 
chemistries containing ACSs, the existence of an ACS does not guarantee that 
bistability exists somewhere in phase space. For example consider the simplest 
possible chemistry (A^ = 2) containing only the monomer (which is buffered) and 
the dimer. If we assume that the sole reaction pair A(l) + A(l) ^ A(2) is 
catalyzed by A(2), the catalyzed chemistry is trivially an ACS and the only rate 
equation is X2 = kfA^{l + HX2) — krX2{l + 1^X2) — 0X2- The system can be solved 
exactly and always goes to a global fixed point attractor starting from any initial 
condition 0:2(0) > 0. However, the A^ = 3 chemistry defined by the two catalyzed 
reactions 

A(l) + A(l) ^^^^ A(2) (4.5a) 

A(l) + A(2) ^^^ A(3), (4.5b) 



does exhibit bistability at a sufficiently large 0. Ohtsuki and Nowak 169 had also 
found a lower limit on catalyst size for bistability to exist in their model. Similar 
results hold for the / = 2 case. From our simulations a general observation seems 
to be that bistability is ubiquitous at sufficiently large values of (p in homogeneous 
chemistries whenever the smallest catalyst is large enough compared to the food 



^A technical remark on the definition and calculation of k^ and k^^: We have defined k^ and 
K^' as the values of k that mark the two extremes of the bistable region. To calculate the limits 
of bistability, for each value of k, we start from two initial conditions, (i) the standard initial 
condition, a;„ = V n > 2, and (ii) a 'high' initial condition, a;„ = 1 V n > 2, and integrate the 
system to identify the respective attractors these initial conditions lead to. k^^ is the value of 
K upto which the system remains in the lower attractor starting from the initial condition (i), 
and, K^ is the value of k at which the system first moves to the higher attractor starting from 
the initial condition (ii). Alternatively, we can define k^ and k^^ by using an initial condition 
which tracks the attractor at a neighbouring value of k, i.e., the steady state concentrations 
calculated at a k value are used as the initial condition to calculate the attractor for kzL S (for 
small 6). Starting from the lower attractor and using initial conditions that track the attractor 
as K is increased, one can define k^^ as the largest value of k for which the lower attractor exists. 
Similarly, starting from the higher attractor and using initial conditions that track this attractor 
as K is decreased, one can define k^ as the lowest value of k for which the higher attractor exists. 
XPPAUT uses the latter definition of k^ and k^^ . In principle the values of k^ and k^^ obtained 
using the two approaches can be different. For the case of ACS4, these two values happen to 
be same at least upto 4 significant figures. The definition of k^^ that we have followed based on 
the standard initial condition is more relevant for the origin of life problem. 
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Figure 4.8: Phase diagram and concentration profiles for ACS4. Continued. 
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Figure 4.8: Phase diagram and concentration profiles for ACS4. Continued. 
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Figure 4.8: Phase diagram and concentration profiles for ACS4. (a) The steady state 
concentration X4 versus k ior kf = kr = A — l,(j) = 25, N = 15. The bistable region exists for 
the range k^ < k < k^^ in which different initial conditions lead to two distinct steady state 
values of X4. The solid black curves correspond to the two stable fixed points, and the dotted 
black curve to the unstable fixed point, (b) The dependence of k^ (red curve) and k^^ (blue 
curve) on 4> for kf = kr = A = 1^ N = 15. The bistable region lies between the two curves; in 
the rest of the phase space the system has a single fixed point. The inset shows the location of 
the critical point (k, (/)); there is no bistability for <f> < (j). (c) The steady state concentration 
profile of molecules shown at five representative points in the phase space (numbered 1 through 
5 and marked in (b)). Note that at the phase point 2 that lies between the k^ and k^^ curves 
there are two steady state profiles corresponding to the two stable fixed points of the system. 
The figure marked 2a shows the profile starting from the standard initial condition, and 2b from 
the initial condition where Xn = 1 for all n. (d) Dependence of the phase boundaries on kf for 
kr = A = 1,N = 15^ with the inset showing the behaviour on a log-log plot. 
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4.4. ACSs and sparse chemistries 

set. When it does exist it seems to provide a crisp criterion for 'ACS domination', 
including 'initiation' (/t > n^^) and 'maintenance' (k > n^). 

ACSs with the same set of reactions (and hence the same product set P{S)) 
may differ from each other in the assignment of catalysts to the reaction. We 
remark that the assignment of catalysts to the reactions of an ACS affects the 
dynamics, specifically the existence and extent of bistability. A detailed discussion 
of an example of how the assignment of catalysts effects the dynamics is given in 
Appendix [Ej 

There exists a substantial literature including results on the conditions for 



bistability in chemical systems 179 181 . It would be interesting to apply some 



of those results to models of the kind being studied here, which involve a large 
number of molecular species. 

4.4 ACSs and sparse chemistries 

We now discuss the dynamics of ACSs when the underlying chemistry is not 
the fully connected chemistry discussed as far, but is a sparse chemistry. We 
generate the sparse spontaneous chemistries with the following modification to the 
algorithm described in Section |3.3[ For every reaction in the catalyzed chemistry 
we include the same reaction (without the catalyst) as part of the spontaneous 
chemistry. Thus if a molecule is produced in m catalyzed ligation reactions, it is 
automatically also produced in m spontaneous ligation reactions. If m is less than 
the degree of chemistry, k, another k — m spontaneous reactions are chosen to be 
added randomly from the remaining ligation reactions producing this molecule. 
The steady state concentration profiles for ACS65 with spontaneous chemistries 



with degree k = Q and A; = 10 is shown in Fig. 4.9 As sparseness increases {k 
decreases) the background concentrations decline. This happens as there are fewer 
pathways to produce the background molecules. There is also a larger variation 
in their concentrations because their production reactions are chosen randomly, 
and background molecules produced in reactions that happen to involve the ACS 
molecules as reactants do better than others. The ACS molecules are seen to 
dominate more strongly over the background in sparser chemistries; this is be- 
cause there are a fewer production pathways to the background that drain their 
concentrations. Similar results are also seen in chemistries with / = 2. 

Sparseness can also effect the value of k^^ . As the chemistry becomes more 
sparse, the number of ways in which a molecule is produced and hence its concen- 
tration in the steady state accessible from the standard initial condition decreases. 
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If a molecule that acts as a catalyst is not produced in sufficient concentration 
that can cause the initiation of the ACS domination to happen at a larger value 
of K. 

4.5 ACSs and heterogeneous chemistries 



In Section 3^, we discussed the effect of heterogeneity in the rate constants on the 
steady state concentrations in a spontaneous chemistry. For the same chemistry 
(rate constants) we study the effect of inclusion of catalysts. The steady state 
concentration proffie for ACS65 with heterogeneous rate constants is shown in 



Fig. 4.10 Again, the molecules produced in the ACS have larger concentrations. 
The maximum concentration that any ACS member achieves is limited by its 
concentration in the equilibrium case of = 0. When > 0, members of the ACS 
have concentrations very close to their limiting values, whereas the background 
molecules have much lower values in comparison. 

4.6 Summary 

We have shown in this chapter that in the presence of an ACS at sufficiently large 
values of k, a small subset of molecules that constitute the ACS can dominate the 
chemistry, i.e., have much larger concentrations as compared to the background 
(which consists of other molecules in the system produced via spontaneous reac- 
tions). We present an analytical reasoning, in the k — ?■ oo limit, for the domi- 
nation of ACS molecules. The chemistries which contain ACSs usually exhibit a 
rich dynamical behaviour such as the existence of mult ist ability. We find that the 
qualitative features of these results are robust to a small heterogeneity present in 
the system and also hold for sparse chemistries. This provides a mechanism of 
'chemical selection' that allows a subset of molecules in a large set of molecules 
(comprising the whole chemistry) to dominate over the rest. We now turn to the 
question: can we build a sparse subset containing large molecules using ACSs? In 
the following chapter we discuss the problem one encounters in production of large 
molecules using ACSs and provide a solution to the problem in later chapters. 
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Figure 4.9: Steady state concentration profile of ACS65 with sparse spontaneous 
chemistry, (a) The steady state concentrations, a;„ versus n for ACS65 is shown for sparse 
spontaneous chemistries with fc = 6 and 10. For comparison the steady state profile for a fully 
connected spontaneous chemistry is also shown. A = A;/ = /c^ = 1, = 15, k = 2.5 x 10^. (b) 
Same as (a) with concentrations on logarithmic scale. 
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100 



Figure 4.10: Steady state concentration profile for ACS65 in a heterogeneous chem- 
istry. The rate constants for the chemistry were generated using ei = 62 = €3 = Zi = Z2 = 1 



(the parameters and the random seed used to generate the chemistry are same as in Fig. 3.10). 
The red curve plots the steady state concentrations for the heterogeneous chemistry at cj) ~ 0. 
(The concentrations aX cj) — are independent of presence of an ACS). The blue curve shows 
the steady state concentrations of the molecules when ACS65 is present in the chemistry, here 
(f> = 15 and k = 3.5 x 10^. The maximum concentration ACS members are able to achieve is 
still govern by their steady state concentration value at = 0. In both cases A — 1. 
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Chapter 5 

Using ACSs to produce large 
molecules: The problem 



WE have seen that in uncatalyzed chemistries, the concentrations of the large 
molecules remain exponentially small (x„ ~ e~'^",7 > 0). In catalyzed 
chemistries, especially in the presence of an ACS, a few specific large molecules 
produced by the ACS can acquire a high population. However, this seems to 
require a large catalytic strength for the catalysts. For example, for ACS65 this 



happens at k > n^^ = 2226342 (see Fig. 4.6), starting from the standard initial 
condition. The fact that such a large catalytic strength is needed to produce 
appreciable concentrations of molecules of even moderate length like n = 65 could 
be a problem for the ACS mechanism to produce large molecules in the kind of 
prebiotic scenario we are considering. In this chapter we characterize the problem 
somewhat more quantitatively by determining how n^^ depends upon the size n 
of the catalysts in the ACS. 

5.1 Scaling of k^ and ^c^^ with the size of the catalyst 

The values of k^, k,^^ for an ACS (for a fixed set of parameters) depend upon its 
topology. The topology of the ACS includes the set of catalyzed reactions and the 
assignment of catalysts to each of the catalyzed reactions. We define the 'length' 
L of an ACS as the size of {i.e., the total number of monomers of all types in) 
the largest molecule produced in the ACS. An 'extremal' ACS of length L will 
be referred to as one in which all reactions belonging to the ACS are catalyzed 
by the same molecule which is the largest molecule (of size L) in the ACS. For 
concreteness, since we are interested in the dependence of k'^ on the catalyst size, 
we consider only extremal ACSs of length L. We assume that the catalyst has the 
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same catalytic strength k, for all the reactions in the ACS. We wish to determine 
the bistable region for such ACSs and in particular how the values of k,^ and k,^^ 
depend upon L. These values depend upon the precise set of catalyzed reactions 
constituting the ACS. For illustrative purposes we consider three different ways 
of generating the ACS described below as Algorithm 1, 2 and 3, which generate 
ACSs with different characteristic structure. 

5.1.1 Generating extremal ACSs of length L 

We describe three different algorithms used for generating a set of reactions that 
provide a pathway to produce a molecule of a given length L from the food set. 

Algorithm 1: Incremental, smallest-stepsize, deterministic construction. 

Each molecule of size n (n = 2, 3, . . . , L) is produced in the reaction A(l) + 

A(n — 1) — 7> A(n). All such reactions for n = 2, 3, . . . , L are included in the 

ACS. 

Example: For L = 5 following reactions are included 

A(1) + A(1)^A(2) 
A(1) + A(2)^A(3) 
A(1) + A(3)^A(4) 
A(1) + A(4)^A(5) 

Algorithm 2: Shortest path, top-down, deterministic construction. Start 
with A(L). If L is even, it is produced in the reaction A(L/2) + A(L/2) — )■ 
A(L). If L is odd, say L = 2m + 1, then A(L) is produced in the reaction 
A{m) + A(m + 1) — )■ A(L). The same algorithm is used to select a reaction 
for the production of the precursor(s) of A(L) (namely, for A(L/2) if L is 
even, and for each of A{m) and A{m + 1) if L is odd (= 2m + 1)), and is 
iterated for their precursors, etc., until the only reactant appearing in the 
reactions is the food set molecule, A(l). All the production reactions men- 
tioned above are included in the ACS. 
Example: For L = 10 following reactions are included 

A(5) + A(5) ^ A(10) 
A(2) + A(3) ^ A(5) 
A(1) + A(2)^A(3) 
A(1) + A(1)^A(2) 



70 



5.1. Scaling of k^ and k^^ with the size of the catalyst 

Algorithm 3: Incremental, random construction. In this method, starting 
from the food set 5" the set of reactants Rk is sequentially enlarged step by 
step {3^ = Ro C Ri C R2 C . . .) to include larger molecules and a reaction 
chosen randomly until a product of length L is obtained. Rk denotes the 
set of reactants at step k in the algorithm, and Lk the size of the largest 
molecule in Rk. At step k, pick a pair of molecules (say X and Y) randomly 
from Rk, and determine the product Z formed if they were to be ligated (X 
being the same as Y is allowed). If the size of this product, L{Z), is < Lk 
or > L, discard the pair and choose another pair. If Lk < L{Z) < L, add 
the ligation reaction X + Y — > Z to the ACS, define -R^+i = Rk^ {Z}; ^^d 
iterate the procedure. If L{Z) = L, add the ligation reaction X + Y — )■ Z 
to the ACS, and stop. Initially {k = 0) the reactant set is just the food set, 
Rq = 5". 

To complete the extremal ACS, we assign A(L) as the catalyst of all the reac- 
tions generated using any of the above algorithms. For the simulations reported in 
the paper, the reverse of each reaction is also included as a reaction catalyzed by 
the same catalyst. For concreteness we have described the Algorithms 1 and 2 for 
the single monomer case; their generalizations to / = 2 have also been considered 
by us. Algorithm 3, as described above, can be used for any /. 

5.1.2 The requirement of exponentially large catalytic strengths 

We constructed ACSs of different values of L using the above three algorithms 



and determined the k^ and n^^ values numerically. These are plotted in Fig. 5A_ 
It is evident that k,^ increases with L according to a power law 

K^ ~ L° (5.1) 

with a ranging from 2.1 to 2.8 for the three algorithms, while k^^ increases expo- 
nentially, 

k'' ~ e"-^, (5.2) 

with p ^ 0.64 for all the algorithms, a and p depend upon the other parameters. 
In particular we find that p increases with 0, i.e., the catalytic strength needed for 
large molecules to arise increases faster with the size of catalyst at larger values 
of dissipation. This generalizes, to a much larger class of models, the results of 



Ohtsuki and Nowak 169 , who found a linear dependence of k^ on L and an 
exponential dependence of n^^ . 
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Figure 5.1: The dependence of the bistable region on catalyst length L. (a) The 

dependence of k^ on L. (b) The dependence of k^^ on L. Simulations were done for extremal 
ACSs of length L generated by three algorithms discussed above, represented in the figure by 
different colours. For each L the ACS in question has the property that the largest molecule 
produced in the ACS has L monomers and catalyzes all the reactions in the ACS. All simulations 
were done ioi kf = kr — A = (j) = 1. N = 100 in all cases except the k^ curve for Algorithm 
1, where N = 200, because in this case 'finite- A^' effects were quite significant at A^ = 100. The 
figures suggest an approximate power law growth of k^ and exponential growth of k^^ with L. 
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5.1.3 Why the initiation threshold is exponentially large 

The contribution of a catalyst to the rate of the reaction it catalyzes appears 
through the factor 1 + kx, where k is the catalytic strength of the catalyst and x 
its concentration. The term unity in the above factor is the relative contribution of 
the spontaneous (uncatalyzed) reaction rate. If the catalyst is to play a significant 
role in the reaction, the catalytic contribution to the reaction rate should be at 
least comparable to the spontaneous rate, i.e., nx should be at least comparable 
to unity. As we have seen earlier the concentration of large molecules is typically 
damped exponentially with their size. Therefore the compensating factor k, needs 
to increase exponentially in order for the catalyzed reaction rate to be comparable 
to the spontaneous reaction rate. For concreteness consider the extremal ACSs of 
length L and consider the steady state population xl of the catalyst A(L) in the 
low fixed point as n is increased. In the spirit of this rough argument one expects 
that at the initiation threshold the term k^^xl should be of order unity. In Fig. 



5.2 we display this product for different values of L. Though there is a secular 



decreasing trend with L, this product remains of order unity (Fig. 5.2a) even as 
the individual factors change over several orders of magnitude (Fig. 5.2b). This 
lends numerical support to the above explanation for the exponential dependence 
of K^^ on L. 

5.2 A problem for primordial ACSs to produce large 
molecules 

A natural initial condition for the origin of life scenario is one where only the 
food set molecules, and perhaps a few other not very large molecules (dimers, 
trimers, etc.) have nonzero concentrations, while the large molecules have zero 
concentrations. It is from such an initial condition that we would like to see the 
emergence of large molecules through the dynamics. 

The exponential increase of the initiation threshold, k^^ , with L, quantifies the 
difficulty in using ACSs to generate large molecules in the primordial scenario of 
the type modeled above. This means that one needs large molecules with unrea- 
sonably high catalytic strengths to exist in the chemistry in order to get them to 
appear with appreciable concentrations starting from physically reasonable pri- 
mordial initial conditions. 
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(a) 




(b) 



Figure 5.2: The product k^^xl is of order unity. For this figure each chemistry was 
simulated at a vahie of k equal to the initiation threshold k^^ corresponding to that chemistry, 
and the steady state concentration x^ of the catalyst was determined in the low fixed point 
(starting from the standard initial condition). The parameters values are the same as in Fig. 



5.1 



(a) The product of k'^ and xl as a function of L. (b) x^ versus k^^ on a log-log plot. 
The slopes of the fitted straight lines vary in the range -1.13 to -1.16 for the three algorithms 
(slope — -1 would have meant that k^^X2 is strictly constant). The figure shows that while each 
individual factor k^^ and xl ranges over several orders of magnitude, their product, though not 
constant, is of order unity. 
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Chapter 6 

Nested ACSs: A structure to 
promote large molecules 



WE now discuss a mechanism that may overcome the barrier of large catalytic 
strengths, and may enable large molecules to arise from primordial initial 
conditions without exponentially increasing catalytic strengths. This mechanism 
relies on the existence of multiple ACSs of different sizes in the catalyzed chemistry, 
in a topology such that the smaller ACSs reinforce the larger ones, thereby enabling 
large molecules to appear with significant concentrations without exponentially 
increasing their catalytic strength. 



6.1 Using a small ACS to reinforce a larger one 

To illustrate the basic idea we consider the following simple example where the 
catalyzed chemistry contains only two ACSs, one of length three and the other of 
length eight (which we refer to as ACSS and ACSS, respectively), each generated 
by the Algorithm 2 mentioned in the previous chapter. All reactions of the former 
are catalyzed by A(3) with a catalytic strength n^, and of the latter by A(8) with 
the catalytic strength ks- Thus the two ACSs are: 

A(3) 



A(l) + A(l) ^^^ 


A(2) 


(6.1a) 


A(l) + A(2) ^^^^ 


A(3), 


(6.1b) 
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and 



A(l) + A(l) 
A(2) + A(2) ^ 
A(4) + A(4) ^ 



A(8) 



\, 



X A(2) 

A(8) ^ 



A(8) 



\, 



A(4) 



A(8). 



(6.2a) 
(6.2b) 
(6.2c) 



The catalyzed chemistry consists of the above five catalyzed reaction pairs (we will 
refer to this catalyzed chemistry as ACS3+8). The network topology is pictorially 



depicted in Fig. 6.1a The system also exhibits bistability, and the concentration 



of A(8) in the two fixed point attractors is exhibited in Fig. 6.2 as a function of 
^3 and Kg. 
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Figure 6.1: Pictorial representation of nested ACSs. (a) ACS3+8, defined by Eqs. (6.1) 



and ^^. (b) ACS3+8', defined by Eqs. ^^ and (|63|. Tlie notation is the same as for Fig. 



4.1 



Tlie dashed arrows (catalytic links) are given in two colours, blue and red, to distinguish 
the two ACSs whose catalysts are molecules A(3) and A(8), respectively. Reactions having two 



catalysts are given by two distinct equations in the text (e.g. (6.1a) and (6.2a|), but in the figure 



are represented by a single reaction node with two incoming catalytic links (the reaction node 
is not duplicated to avoid visual clutter). 
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Figure 6.2: Reinforcement of a larger ACS by a smaller one: The case of ACS3+8. 
The figure shows the steady state concentration xg (in colour coding as indicated) for two 
different initial conditions as a function of /C3 and Kg, the catalytic strengths of A(3) and A(8) 
respectively. All simulations were done ioi kf = kr = A = l,(j) = 20, N = 100. The two figures 

(a) and (b) differ in the initial condition of the dynamics, (a) The standard initial condition, 

(b) initial condition a;„ = 1 for all n = 2, 3, . . . , iV. 
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When K3 is small the two pictures in Fig. 6.2 show the usual bistability of ACSS 
along the ks axis. The initiation and maintenance thresholds are k^^ = 1.78 x 10^ 
and Kg = 1145 given by the location of the boundary between the low concentration 
region (blue, Xg ~ 10^'^) and the high concentration region (yellow Xg ~ 1) along 
the Ks axis in Figs. 6.2a and |6.2b respectively. As K3 increases, the initiation 



threshold of ACSS decreases slowly for a while (in Fig. 6.3 for ^3 = 100, ACSS 
dominates at /tg = 1 x 10^), then drops sharply near ^3 = 141. This value of k,^ 
is the initiation threshold of ACSS when ks = 0. When K3 exceeds this value, the 
steady state value of Xg is either high (yellow, Xg ~ 1) or intermediate (orange, 
xg ~ 10^'^), depending upon the value of Kg (see panels corresponding to ^3 = 200 



and 1000 in Fig. 6.3) 



The key point is that the initiation threshold of the larger catalyst depends on 
the catalytic strength of the smaller catalyst. The former plummets sharply when 
the latter approaches the initiation threshold of the smaller catalyst, dropping to a 
much lower value than before (compare the lower limit of the yellow region in Fig. 



6.2a to the left and right of K3 = /t|'^ = 141; the value of Kg^ plunges several orders 
of magnitude from 1.7S x 10^ at k,^ = down to 217S at ks = 141). Starting from 
the standard initial condition, thus, the larger catalyst can acquire a significant 
concentration at a much lower value of its catalytic strength in the presence of a 
smaller ACS operating above its initiation threshold than in its absence. 

6.1.1 Why a small ACS reinforces a larger one: The role of 
the background and spontaneous reactions 

When K, exceeds the initiation threshold for a catalyzed chemistry containing an 
ACS, not only do the steady state concentrations of the ACS product molecules 
rise by several orders of magnitude, but also those of the background molecules 



rise. As an example compare the two steady state profiles of ACS65 in Fig. 4.2a 
which correspond to values of k below and above the initiation threshold. As one 
goes from the lower to the upper curve, the concentration of the ACS members 
of course increases dramatically (as shown by the sharp peaks), but note that the 
concentrations of other molecules not produced by catalyzed reactions also goes up 
significantly. Thus in the chemistry containing two ACSs (ACS3+S) as one moves 



along the K3 axis in Fig. 6.2a and crosses the initiation threshold of ACSS {i.e., 
K3 exceeds k^^ = 141), the concentration of A(S) (a molecule belonging to the 
background of ACSS as its production is not catalysed by ACSS) increases from 
~ 10^^ (blue region) to ~ 10^'^ (orange region). This increase in the concentration 
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Figure 6.3: Concentrations in ACS3+8. (a) Time evolution of concentrations of A(3) 
(shown in red curve) and A (8) (green curve), (b) Steady state concentrations profiles (shown 
up to n = 15). The concentrations are shown for three values of K3 and Kg (nine pairs in total). 



Other parameters take same values as in Fig. 6.2 
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of A(8) by a factor of ~ 10^ makes it easier for ACSS to function and its initiation 
threshold drops by a corresponding factor of about 10^ (from ~ 10^ to ~ 10^). 

This fact highhghts the role of spontaneous reactions in the overall dynamics. 
The background molecules are connected to the ACS through spontaneous reac- 
tions, and if it were not for the latter, an ACS would not be able to push up the 
concentrations of its nearby background. We shall refer to a structure such as 
the one described above containing ACSs of different sizes with the smaller ACS 
feeding into the larger one through the spontaneous reactions as a 'nested ACS' 
structure. 

We remark that the "distance" between the catalysts of the two ACSs (as 
calculated on the network of spontaneous reactions linking the two molecules) is 
pertinent to the point under discussion in this section. If the two ACSs are very 
distant, e.g., instead of two catalysts of size 3 and 8 in the above example we had 
studied catalysts of size 3 and 20, then even though the smaller ACS raises the 
concentration of the larger catalyst, the rise may not be sufficient to allow the 
larger ACS to dominate at reasonable catalytic strengths. 

6.2 The role of 'overlapping' catalyzed pathways in 
nested ACSs 

The above example also serves to highlight some other features of catalyzed 
chemistries containing multiple ACSs. Note that the production pathway of A(8) 



in ACSS (Eqs. 6.2 and Fig. 6.1a) contains one reaction pair in common with 
ACS3, namely the reaction pair A(l) + A(l) ^ A(2). One can consider a situa- 
tion wherein the overlap is greater. E.g., consider the ACS8' defined by 

"™ (6,3a) 



A(l) + A(l) 


^— ^ A(2) 


A(l) + A(2) 


""^'^ ^ \(3) 


\ iV|^^5J 


A(2) + A(3) 


^^'^ ^ \(5) 


\ ^H^J 


A(3) + A(5) 


^^^ A(8) 



(6.3b) 
(6.3c) 
(6.3d) 

Now the set of reactions in ACS3 is a subset of ACS8' (ignoring the catalyst, which 
is different in the two cases). The degree of overlap of the catalyzed reaction sets 
between a pair of nested ACSs makes a difference in the dynamics. Consider, for 
example, the catalyzed chemistry consisting of ACS3 and ACSS', i.e., the set of 
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Figure 6.4: Examples of nested ACS pairs with different degrees of overlap for 

/ = 2. In the three cases the reaction sets have (A) maximal overlap, (B) partial overlap, (C) 
no overlap. The blue and red squares marking the grid points indicate the identity of molecules 
produced in the two ACSs; blue filled squares correspond to the products of the smaller ACS, red 
unfilled squares to those of the larger ACS. The x and y axes denote the number of monomers 
of type (1,0) and (0,1), respectively, in the molecules. The green rhombuses represent the two 
monomers. 



catalyzed reactions given by Eqs. (6.1) and (6.3), which we refer to as ACS3+8'. 



This is picturized in Fig. |6.1b[ Like ACS3+8, this chemistry also shows a reduction 
of Kg^, when ^3 exceeds its initiation threshold. We find that while at ks = the 
value of kI^ for the two chemistries is not too different (1.6 x 10"^ for ACS3+8' 
versus 1.8 x lO'^ for ACS3+8), at K3 = 141, k^^ reduces to a value 920 in ACS3+8', 
which is less than half of the value 2178 that it reduces to in ACS3+8. Thus a 
larger degree of overlap between the catalyzed reaction sets of nested ACSs causes 
more effective reinforcement. 

Another example with this behaviour for / = 2 is described in Fig. 6.4 In 



each of the three ACS pairs shown in the figure, the smaller ACS, of length 4, is 
the same, (it will be referred to as ACS (2,2)) and is defined by the reactions (each 
catalyzed by (2,2)) 



(0,1) + (1,0) 
(1,1) + (1,1) 



(2,2) 



\, 



XT 



(2,2) 



^^ 



^ 



(1,1) 

(2,2). 



(6.4a) 
(6.4b) 



The three larger ACSs, called ACS(5,3)(a), ACS(5,3)(b) and ACS(5,3)(c), respec- 
tively, can essentially be determined from the figure. For example, ACS(5,3)(a) 



consists of the two reaction pairs given by Eqs. (6.4), both catalyzed by (5,3) as 
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well as the three reactions 

(1.0) + 

(1.1) + 
(1,0) + 



ACS (5, 3) (b) consists of the single reaction pair given by the first of Eqs. (6.4) 
catalyzed by (5,3), as well as the four reactions 

(5,3) 



(1,0) + 
(1,0) + 

(1,1) + 
(1,1) + 

and ACS (5, 3) (c) consists of the 

(1,0) + 
(0,1) + 

(1.0) + 
(0,1) + 

(2.1) + 



2,2) 




(3,2) 


3,2) 


(5,3) X 


(4,3) 
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4,3) 


(5,3) ^ 


(5,3) 



(6.5a) 
(6.5b) 
(6.5c) 
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five reaction pairs 
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(6.6a) 
(6.6b) 
(6.6c) 
(6.6d) 

(6.7a) 
(6.7b) 
(6.7c) 
(6.7d) 
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The network topologies for three catalyzed chemistries are depicted pictorially 



in Fig. |6.5 

We consider the population dynamics of chemistries in which the spontaneous 
part includes all possible ligation and cleavage reactions involving molecules with 
upto A^ = 15 monomers with homogeneous rate constants kf = kr = 1, (f) = 15, 
and the catalyzed part containing one or more of the above mentioned ACSs. 
When ACS(2,2) is the only ACS present, the system shows bistability with the 
initiation threshold being ^[22) ~ ^51. When ACS(5,3)(a), (b) or (c) are the 
only ACSs present, the initiation thresholds for them are 1125197, 1031082, and 
1000112, respectively. When ACS(2,2) and one of ACS(5,3) (a), (b) or (c) are 
both present, and the catalytic strength of (2,2) is 552, the initiation thresholds 
of the three larger ACSs reduce to 941, 1256, and 2482, respectively. Again, it 
is seen that the larger the degree of overlap of the two nested ACSs, the more 
effective is the reinforcement. 
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Figure 6.5: Pictorial representation of nested ACSs with f — 2. Continued. 
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Figure 6.5: Pictorial representation of nested ACSs with / = 2. Each of the three 
figures shows two nested ACSs. The smaller ACS in each figure is ACS(2,2), defined by Eqs. 



(6.4). The larger ACS is different and is given by (a) ACS(5,3)(a), (b) ACS(5,3)(b), and, (c) 



ACS(5,3)(c), defined below Eqs. (6.4). The notation is the same as in Figs. 4.1 and 6.1 
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6.3 Summary 



In this chapter we have considered nested ACSs in a simple setting, wherein the 
catalyzed chemistry contains just two extremal ACSs, one larger than the other. 
The catalyst of the larger ACS is in the background of the smaller ACS, i.e., it 
can be produced from the product molecules of the smaller ACS via pathways 
of spontaneous reactions. We have shown that this reduces the catalytic strength 
required for the larger ACS to dominate by a substantial amount (by several order 
of magnitude in the examples considered). We have given an explanation for this, 
namely, that when the smaller ACS operates above its own initiation threshold, it 
boosts the concentrations of its background molecules including that of the larger 
catalyst by a large factor resulting in a corresponding decrease of the initiation 
threshold of the larger ACS. 
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Chapter 7 

Hierarchy of nested ACSs: A 
possible route for the appearance of 
large molecules 



THE process of nesting discussed in the previous chapter for two ACSs can be 
extended to multiple levels of ACSs connected to each other. Here we discuss 
sequences of ACSs of increasing size, with the catalyzed reaction set of each ACS 
in the sequence partially or completely contained within the next one, and the 
catalytic strength of molecules increasing with size in a controlled manner. We 
construct examples of such sequences in which large catalyst molecules containing 
several hundred monomers can acquire significant concentrations starting from 
the standard initial condition, even though all catalysts have moderate catalytic 
strengths. 

We construct a hierarchy of nested ACSs using two algorithms. In order to 
construct a cascade of nested ACSs in which reaction sets of smaller ACSs are 
completely contained in the larger ones (maximal overlap), we used Algorithm 
4 (described below). This algorithm produces a cascade of ACSs with g steps 
(generations), with the k^^ generation ACS containing n^ new reactions. We 
studied several catalyzed chemistries containing a cascade of nested ACSs for 
/ = 1 and 2. One example of each type is presented below; other examples 
gave qualitatively similar results. To construct ACSs with a variable degree of 
overlap, we used Algorithm 5. At each step A;, an extremal ACS of length Lk 
is added, generated using Algorithm 3. In Algorithm 5 the sequence of lengths, 
Li,L2, . . . ,Lg, is specified before running the algorithm. 
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molecules 

7.1 Generating a cascade of nested ACSs 

Algorithm 4: Incremental, random construction of a sequence of reac- 
tion sets with maximal overlap. We construct successive sets, or 'gener- 
ations', g in number, Pi,P2, . . . Pg, of product molecules, starting from the 
food set ?"(= Pq). Each generation P^ {k = l,2,...g) has a pre-specifed 
number of molecules, n^ (ni, n2, . . .Ug need to be specified before running 
the algorithm). At step k of the cascade an ACS Sk is constructed from the 
previous ACS Sk-i by adding reactions between molecules belonging to a re- 
actant set R^ consisting of all the products of the previous generations and 
the food set, Rk = Pq U Pi U P2 • • • Pfc-i- Let Lk denote the size of the largest 
molecule in Rk- At the beginning of the k^^ step, Pk is empty and the set of 
reactions in S^ is the same as in Sk-i, except that the catalysts of the reac- 
tions in Sk are not yet assigned. (5*0 is the empty set.) To construct Pk and 
Sk, pick a molecule X at random from the previous generation of products 
Pk-i and another molecule Y at random from Rk, and determine the prod- 
uct Z formed if they were to be ligated (X being the same as Y is allowed). 
If the size of this product, L{Z), is < Lk, discard the pair and choose an- 
other pair. If L{Z) > Lk, add the molecule Z to Pk and the ligation reaction 
X + Y — 7> Z to S'fc. Repeat this procedure until n^ molecules are added to 
Pk and Uk reactions are added to Sk- Assign catalysts to each reaction in Sk 
(which includes the reactions inherited from Sk^i and the new n^ reactions) 
randomly from P^. This completes the k^^ step. To get the full cascade with 
g generations this process is carried out for k = 1,2, . . . g. By construction 
the set of reactions (ignoring the catalyst) of each Sk is fully contained in 
that of Sk+i but the catalysts are different, being drawn from Pk for Sk and 
Pk+i for Sk+i- The union of the S'^'s constitutes the set of reactions in the 
catalyzed chemistry. Note that in this catalyzed chemistry reactions have 
multiple catalysts, the multiplicity declining for reactions producing higher 
generation molecules. 

The size of the food set constrains how large Uk can be. For / = 1, ni = 1 
and Pi = {A(2)} as the only product one can make from a reaction in the 
food set is A(l) + A(l) — )■ A(2), and n2 can only have values 1 and 2 as the 
only products one can produce in the second generation (from reactants in 
R2 = {A(l), A(2)}) are A(3) and A(4), etc. Similarly for f = 2, rii can be 
only 1,2 or 3, as reactions among the two food set molecules (1, 0) and (0, 1) 
can only produce three molecules (2, 0), (1, 1) and (0, 2). 



7.2. Dominance of an ACS of length 441 (ACS441) 

Algorithm 5: Incremental, random construction of a sequence of reac- 
tion sets with partial overlap. In this algorithm we decide on a sequence 
of increasing lengths, Li, L2, . . . , Lg, and generate an extremal ACS, denoted 
Si, of length Li using Algorithm 3 for each i = 1,2, . . . ,g. The union of the 
Si^s constitutes the set of reactions in the catalyzed chemistry. 

For each i a different random number seed is used to initialize Algorithm 
3. Changing the random number seed results in a different set of reactions 
defining the ACS. 



7.2 Dominance of an ACS of length 441 (ACS441) 

For / = 1 we describe a cascade with g = 15 and ni = l,n2 = n^ = ■ ■ ■ = nis = 2. 
This catalyzed chemistry had 29 product molecules, the largest of which was 
A(441) having 441 monomers. The list of molecules and reactions is given in 
Appendix |Fj The catalytic strength k, of each molecule was chosen by an explicit 
length dependent rule 

k{L) = KL^, (7.1) 

where K and /3 are constants. We describe a simulation with K = 500 and /3 = 1.5. 



This particular rule was chosen to contrast with Eq. (5.2) which characterizes the 



initiation threshold of an extremal ACS of length L. For a value of L such as 



441, the exponential function in Eq. (5.2) would have given an astronomically 



large catalytic strengtHH whereas the much slower growing power law in Eq. (7.1 ) 
gives k(441) = 4.6 x 10^ for the above mentioned values of the constants. Starting 
from the standard initial condition, the steady state concentration profile of this 
catalyzed chemistry embedded in a fully connected spontaneous chemistry with 
A^ = 800 is shown in Fig. 7.1a and for sparsely connected spontaneous chem- 
istry with k = 20 and 2 in Fig. |7.1b| and 7.1c respectively. As also seen for 



chemistries with a single ACS (Fig. 4.9, Section 4.4) the increase of sparseness 
causes the background concentrations to decline, while ACS molecules dominate 
more strongly. 



^Extrapolating Eq. (5.2) witli p = 0.64 as given below that equation would give k = 
gO. 64x441 _ 2 yg ^ lo^^^^ ^g mentioned in Section 5.1.2 p depends upon other parameters, 
including (j). 



The value p = 0.64 given in Section 5.1.2 is obtained ior (/) — I (see Fig. 5.1). 
In this section we discuss simulations with larger values of (f) for which p is large. Thus the 
corresponding values of k from Eq. ( |5.2[ ) (for extremal ACSs) would actually be significantly 
larger than 3.76 x lO^^^. 
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Figure 7.1: The dominance of a cascade of nested ACSs with a molecule of size 441 

(ACS441). Continued. 
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Figure 7.1: The dominance of a cascade of nested ACSs with a molecule of size 
441 (ACS441). The molecules and reactions of this ACS are listed in Appendix If] The red 
curves show the steady state concentration a;„ of all the molecules as a function of their size 
n, starting from the standard initial condition; blue dots show the concentrations of the ACS 
molecules. Insets show the same on a semi-log plot. It is evident that the large ACS molecules 
acquire a significant concentration. The catalytic strengths of the ACS molecules depend upon 
their size n according to K{n) — 500 x n}-^, and for all cases kj ^ kr = 1, cj) = 50, N — 800. The 
three figures differ in the level of sparseness of the spontaneous chemistry in which the ACS is 
embedded. The spontaneous chemistry in (a) is fully connected, in (b) has degree 20, and in 
(c) has degree 2. 
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molecules 

This example shows that with the nested ACS structure in the catalyzed chem- 
istry, large catalyst molecules can acquire significant concentrations starting from 
an initial condition containing only the monomers, even when catalytic strengths 
grow quite slowly with the length of the catalyst. It is worth noting that product of 
the catalytic strength of A(441) and its steady state concentration {x^i = 0.0077) 
is about 36000, and this is the factor by which it speeds up the reactions it cat- 
alyzes over the spontaneous rate. In view of the fact that enzymes containing 
a few hundred amino acids speed up the reactions they catalyze within cells by 
factors of about 10^ and greater, the catalytic efficiency demanded of A(441) does 
not seem unreasonably high. 



Eq. (7.1), which gives a particular functional form for k(L), is ad-hoc, and, 
at this stage, merely an example given to quantify the level of catalytic strengths 
that is sufficient for large molecules to arise in appreciable concentrations in the 
prebiotic scenario under consideration if chemistry has the nested ACS structure 
of the kind discussed. One may ask if an even weaker requirement would suffice. 
We have considered smaller values of /3 (1.2 and 1.0) keeping K fixed, and found 
that ACS molecules upto a particular size (depending on f3) do well but that the 
concentration of larger ACS molecules trails off and merges with the background 



(Fig. 7.2). The size range of ACS molecules that do well can be increased by in- 
creasing the coefficient K. Since the results depend upon several factors, including 
the topology of the ACS and the uncatalyzed chemistry, a detailed investigation 
has not been carried out. 

7.3 Cascading nested ACSs with / = 2 

An example of a nested ACS with two food sources, ACS(36,28), is presented 



in Fig. |7.3[ This is also generated by Algorithm 4 and has 7 generations with 
rifc = 3 for each generation, the largest molecule being (36,28) (the full list of 



molecules and reactions is given in Appendix G). Again starting from the standard 
initial condition the larger ACS molecules acquire appreciable concentrations with 
a moderate demand on their catalytic strengths. 



As a final example we present in Fig. 7.4 a cascade of ACSs, named ACS(18,27) 



after its largest molecule, in which smaller ACSs have only a partial overlap with 
longer ones. This is generated using Algorithm 5 and consists of a series of 10 
ACSs of increasing length. The detailed list of molecules, reactions and catalytic 
strengths is given in Appendix |HJ At each step, we use the random number seed 
to control the degree of overlap produced between successive generations of ACSs. 
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Figure 7.2: Dominance of larger molecules in ACS441 trails off for /3 = 1.2. This 



figure is for the same parameter values as Fig. 7.1b but with K{n) — 500 x n^-^. It can be seen 
that ACS molecules upto A(288) are able to dominate over the background (albeit with smaller 



concentrations than in Fig. 7.1b I while larger ones trail off and merge into background. 



At each step z, the random number seed was so chosen that the catalytic strength 
required for ACS Si to dominate (given all the ACSs of previous generations) is 
not too large compared to the one for ACS Si^i and, at the same time, each 
successive ACS in the cascade has only a few reactions in common with other 
ACSs. Unlike in the examples discussed above, generated by Algorithm 4, in the 
present case molecules (except the small molecules) produced in the catalyzed 
chemistry have typically only one or two catalyzed ligation reactions producing 
them. The chemistry also contains a number of catalyzed 'side reactions', which 
produce molecules that are neither catalysts nor reactants in any pathway leading 
to the largest molecule. In fact there is a 'side pathway' consisting of several 
reactions that may be viewed as 'draining the resources' of the main ACS. ACS 
dominance at moderate catalytic strengths occurs for this chemistry also. The 
largest n is 50000 for the molecule (18,27), and at a steady state population of 
0.26 enhances the rate of a reaction by a factor of 13000 over the spontaneous 
rate. This shows that the mechanism outlined by us is not restricted to maximally 
overlapping nested ACSs but is more generic. 
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Figure 7.3: Dominance of a cascade of nested ACSs with length 64 (ACS(36,28)) 
in a / = 2 chemistry, (a) 3D plot showing the steady state concentration a;„ of the molecule 
n — (rii, 712) as a function of ni and 712, starting from the standard initial condition. The colour 
coding is on a logarithmic scale of the concentration, (b) A 'top view' of the same so that the 
ACS molecules and background are more clearly distinguished. The colour coding here is on a 
linear scale of concentration. The food set and ACS molecules have the highest concentrations 
and stand out as black dots. The catalytic strengths of the ACS molecules depend upon their 
size L = rii + n2 according to n{L) — 500 x L^-^ , and kj = kj. — \, (p — IQ, N — 65. The 
spontaneous chemistry has degree 20. 
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Figure 7.4: Dominance of a cascade of partially overlapping nested ACSs 
(ACS(18,27)). Continued. 
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Figure 7.4: Dominance of a cascade of partially overlapping nested ACSs 
(ACS(18,27)). (a) Steady state concentration profile starting from the standard initial con- 
dition, (b) Top view of the same, (c) Sequence of steady state concentration profiles as each 



successive ACS is added to the chemistry. The legend for (a) is the same as for Fig. 7.3a and for 



(b) and (c) the same as Fig. 7.3b kf — kr — I, (j) — ^^t ^ — 50, and the spontaneous chemistry 
has degree 5. 
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Chapter 8 



Discussion and future outlook 



8.1 Summary 

To summarize, we have considered the population dynamics in a well stirred chem- 
ical reactor of an artificial chemistry in which large molecules can in principle 
be produced by successive ligations of pairs of smaller molecules, and where the 
populations of the smallest molecules, the 'food set' molecules or monomers, are 
buffered. The chemistry contains a large number of spontaneous reactions of which 
a small subset could be catalyzed by molecules produced in the chemistry. We 
have particularly considered the case where the set of catalyzed reactions contains 
autocatalytic sets. The main questions this thesis has been concerned with are: 

(a) under what circumstances do molecules belonging to the ACSs dominate in 
concentration over the rest of molecules in the chemistry (the background), and 

(b) starting from an initial condition that does not contain good catalysts, does 
the dynamics of such a chemical network permit a sparse set of large molecules 
(containing several tens or a few hundred monomers) that are good catalysts to 
arise and be maintained in the reactor at concentrations significantly above the 
background? 

Under (a) we confirm earlier results (Bagley and Farmer 159, 172|) that the 



mere existence of ACSs in the chemistry at the level of network topology is not suf- 
ficient to guarantee their domination at the population level, and extend them to 
quantitatively characterize the interplay between ACS topology and rate constants 
including catalytic strengths, dissipation rate, etc., that results in ACS domina- 
tion. In the absence of catalyzed reactions, and even the presence of an ACS if 
the catalytic strength k of catalysts is not large enough, the steady state con- 
centration profile in such a reactor shows an exponential decline with the length 



of the molecule (see, e.g.. Figs. 3.3, 3.5, 4.2a and Eqs. (3.4), (3.1)). In order 
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for ACS molecules to shine above the background, the catalytic strength needs 
to be sufficiently large. We show that a high dissipation rate together with a 
large catalytic strength contributes to ACS domination, the former contributing 
to keeping the background low and the latter to preferentially enhancing the ACS 



molecule concentrations (Fig. 4.2 and the discussion below Eqs. 4.3). We show 
that ACSs often produce a bistability in the dynamics (within a range of catalytic 
strength k^ < n < k^') with ACS molecule concentrations being appreciable and 
much higher than the background in one of the two stable fixed points (the 'high' 
fixed point), and being low and comparable to the background in the 'low' fixed 



point (Figs. 4.2, 4.6, 4.8). Bistability provides a crisp criterion for ACS domina- 
tion: If the reactor initially contains only the monomers - the relevant condition 
for the origin of life problem under consideration (or more generally, if the initial 
conditions are in the basin of attraction of the low fixed point), ACS molecules 
will dominate in the steady state only if k > n^^ . If the initial conditions are in the 
basin of attraction of the high fixed point (and this means that the concentration 
of the catalysts is already high), ACS molecules will dominate when k > k' . We 
also show that for extremal ACSs (in which the largest molecule is the catalyst 
of all the catalyzed reactions) the 'maintenance threshold' k,^ grows as a power 
of the size of the catalyst, while the 'initiation threshold' k^^ grows exponentially 



with the size of the catalyst (Fig. 5.1, Eq. (5.2)). This extends the results of 
Ohtsuki and Nowak 169 to a much larger class of models. Since k^^ characterizes 
the lower limit of catalytic strength needed to get to the 'high' fixed point starting 
from an initial condition containing only small molecules, the last result quantifies 
the difficulty for large molecules to dominate in a prebiotic scenario via the ACS 
mechanism, namely, the requirement of an unreasonably large catalytic strength. 
Under (b) we present a possible resolution to this problem via nested ACSs. We 
show that if an ACS catalyzed by large molecules contains within it (or partially 
overlaps with) a smaller ACS catalyzed by smaller molecules (we refer to this as 
a 'nested ACS' structure), the catalytic strength required for the large ACS to 



dominate comes down drastically (Fig. 6.2 and the associated discussion). Effec- 



tively the small ACS reinforces the larger one. We argue that this reinforcement 
(as measured by a reduced k^^ of the large ACS) has its roots in the population 
upliftment that the small ACS provides to its background which includes the large 
catalyst. One can construct a cascade of nested ACSs with each successive and 
larger ACS in the cascade lying in the background of the smaller ACSs, thereby 
receiving reinforcement from them and in turn providing reinforcement to the next 
larger ACS in the cascade. We use this mechanism to construct several examples of 
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cascaded nested ACSs with moderate catalytic strengths in which large molecules 
(containing upto several hundred monomers) are found to dominate significantly 



over the background starting from only monomers in the initial state(Figs. 7.1 



7.3[ 7.4 We show that even a k growing as slowly as a power of the catalyst size 



(Eq. (7.1)) is enough to get large molecule domination through the mechanism of 
nested ACSs. This is a significant improvement over the exponentially growing re- 
quirement for n in part ( a) . We construct several examples including one in which 
a large catalyst (with close to 450 monomers) arises with a dominant population 
in the steady state starting from initial conditions containing only the monomers. 
This catalyst has a reasonable catalytic strength compared to enzyme catalysts 
(in the steady state it enhances the rates of reactions that it catalyzes by a factor 
of less than 10^ over the spontaneous rate). 

Thus our work presents a possible resolution to the chicken and egg problem in 
chemistry posited by the existence of large molecules. In the context of the present 
model the appearance of large molecules is a natural dynamical consequence of 
chemistry possessing the structure described above - a cascading nested ACS 
structure (with a not very demanding set of catalytic strengths) embedded in 
a spontaneous chemistry - together with the buffered presence of the food set 
molecules in a well stirred region of space. The mechanism is an incremental one: 
to begin we require the small molecules that are built by spontaneous reactions 
from the food set to have weak catalytic properties, but after that we only require 
molecular properties {i.e., catalytic strength) that are an extension of existing 
molecular properties; at no point do we require a molecular property that is a 
radical departure from what already exists. Through the mechanism of sequential 
population support to successively larger (and stronger) catalysts from smaller 
(and weaker) ones, and the nonlinearity and feedback inherent in autocatalytic 
systems, the large catalysts effectively bootstrap themselves into existence. 

8.2 Possible applications of the model to prebiotic 
scenarios 

The kind of mathematical model we have studied, inspired by the work of Bagley 
and Farmer, is quite abstract; its virtue is the economy of assumptions that go 
into its structure. The main ingredients are that objects can combine with each 
other in processes or 'reactions' to form other objects and certain objects can facil- 
itate certain processes, i.e., 'catalyze' certain reactions. The population dynamics 
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implements a simple scheme for how the abundances of the objects would change 
with time assuming that the probability of objects combining is proportional to 
their abundances. Such a generic scheme while it applies in detail to no particular 
situation allows us to imagine mechanisms at a conceptual level. It is significant 
that in this scheme an ACS can direct the flows towards itself and cause a certain 
sparse subset of objects, including some specific large composite ones, to capture 
a large fraction of the chemical resources. 

At this level of abstraction the model (or a variant with qualitatively similar 
features) could apply to the peptide chemistry as well as an RNA chemistry and to 
a prebiotic metabolism, as already noted by Bagley and Farmer. Indeed it would 
be equally applicable if a prebiotic environment actually had a mixture of ingredi- 
ents from all these classes of chemistries, a possibility that has been advocated in. 



e.g., 182,183. Copley, Smith and Morowitz 182 have proposed a scenario which 
seeks to explain how the RNA world might have originated through a series of 
incremental steps starting from a primitive metabolism. The food sources for this 
supposed metabolism are CO2, H2, H2S, NH3, etc., in a hydrothermal vent. Their 
scenario envisages multiple stages of increasing complexity which they refer to as 
(i) the monomer stage, in which metabolism, possibly powered by an autocat- 
alytic set such as the reverse TCA cycle, produces nucleotides and simple amino 
acids, [a) the multimer stage, which produces dimers and small cofactors, {Hi) the 
micro-RNA stage, producing of oligonucleotides of length 3-10, (iv) the mini-RNA 
stage, with ll-40mers, followed by {v) the macro-RNA stage, or the RNA world. 
In their scenario each successive stage produced better catalysts that collectively 
catalyzed not only their own production from the molecules of the previous stage, 
but also the reactions of the lower stages. This structure is very similar to the 
cascade of nested ACSs that we have discussed. A suitably modified version of 
our model could be constructed to explore the dynamics of this scenario in more 
detail. At a general level, in the fact that the dynamics of our model results in 
the stable domination, or concentrating, of the large catalysts, our mathematical 
work perhaps lends support to the workability of such a scenario. 

There is another level at which the present model (or its variants) might talk 
to prebiotic chemistry. Morowitz [92] has suggested that the metabolic network 
itself has a shell like structure which can convert simple molecules like CO2, H2, 
NH3, etc., through "a hierarchy of nested reaction networks involving increasing 
complexity" into purines, pyrimidines, complex cofactors, etc. Reaction sets cre- 
ated by our Algorithms 4 and 5 are reminiscent of this picture. Missing from 
Morowitz's picture is a catalyst assignment for each reaction from among the 
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molecules in the various shells or from among other catalysts accessible prebioti- 
cally (e.g., surfaces in hydrothermal vents). It might be worthwhile to attempt to 
add in that information for a more complete scenario and for potential modeling. 

8.3 Caveats and future directions 

1. We have studied the properties of autocatalytic systems, by choosing spe- 
cific examples of ACS topology and special algorithms for constructing them. 
This has allowed us to systematically investigate the dynamical properties 
that ACSs offer. We believe that similar dynamical properties would hold for 
more general topological structures than we have considered. Nevertheless 
the question arises as to whether all these structures are very special struc- 
tures and whether or not they are likely to arise within real chemistry and 
'generic' artificial chemistries. ACSs have been shown to be quite generic in 



a large class of randomly constructed artificial chemistries 184 , 185 , and a 
similar analysis could be extended to cascading nested ACSs. This would 
help parametrize or characterize chemistries that would contain such struc- 
tures and those that would not. In this context it would useful to go beyond 
the simple case we have considered in which a molecule is defined by the 
number of monomers of each type and not their sequence. It may also be 
interesting to look for structures similar to nested ACSs in real metabolic 
networks using methods similar to those in |186|. 



2. An important related question is one of side reactions (discussed in 62 



187|) which might destroy the efficacy of ACSs. In the real chemistry one 



expects that even if cascading nested ACSs exist, there would also exist other 
catalyzed reactions channelizing the ACS products into pathways leading 
in other directions. Whether substantial populations of large molecules in 
the nested ACSs can be maintained in the presence of such diversion is a 
question that remains to be systematically investigated. Our last example 
of cascading nested ACSs in fact has several side reactions and it may be 
noted that large ACSs still dominated in that case. We remark that while 
side reactions can drain resources from ACSs, they also help the system to 
explore new directions in chemical space in an evolutionary scenario. 

3. We have considered deterministic dynamics in this paper. Stochastic fluc- 
tuations are important when molecular populations are small and can have 
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interesting effects on ACS dynamics 166,188 190 . For a chemistry con- 



taining multiple ACSs, Bagley, Farmer and Fontana 160 used stochasticity 
to produce examples of trajectories that differed from each other in the se- 
quence of ACSs that came to dominate the reactor. It would be interesting 
to explore such effects in the context of the present model. 

4. Another simplification we have made is that of homogeneous chemistries, 
wherein the rate constants of all spontaneous reactions have been taken to be 
the same, and even catalytic strengths, where variable, have been taken to be 
smooth functions of the length. We have also shown that introducing a small 
amount of heterogeneity or randomness in the rate constants does not change 
the qualitative behaviour significantly. However, the effect of cranking up the 
heterogeneity has not been studied. From studies of disordered systems in 
statistical mechanics and condensed matter physics it has become clear that 
such heterogeneity can lead to rugged landscapes, multiple attractors and 
timescales, and paths that are difficult to locate (191| . The dynamics of such 
systems when they are driven by a non-equilibrium flux or buffering of food 
set molecules is an open question. It is possible that the constraints placed 
by the ruggedness of the landscape will reduce the number of accessible 
paths. The populating of molecules at different levels in a nested hierarchy 
of ACSs is likely to happen in fits and starts on multiple timescales when 
heterogeneity is accounted for. It is perhaps in such a scenario that one 
should look for answers to the questions raised under (1), (2) and (3) above. 

5. The present model describes a population dynamics in a fixed environment. 
However the environment can change over a longer time scale leading to a 



complex evolutionary dynamics 163, 164 . We remark that in the context of 



the present model, the exogenous supply or depletion of certain molecules 
may affect the dynamics and allow the system to explore other steady states. 
For example, the system in the lower attractor of the bistable region may get 
elevated to the higher attractor if the exogenously supplied molecules can 
move the system from the basin of the lower attractor to that of the higher 
attractor. A sudden flux out of the system, may deplete the system of some 
molecules; this may cause a transition from the higher to the lower attractor. 
It may be interesting to study the effect of exogenously supplied molecules 
as well as of the sudden removal of molecules from the system and eventually 
incorporate an evolutionary scenario in which a more realistic setting for the 
origin of life could be investigated. 
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6. The model we have discussed has no spatial structures, i.e., the dynamics 
of the system happen in a homogeneous well stirred region of a prebiotic 
niche. One of the important steps in the origin of life was the appearance 
of enclosures. One can consider a scenario in which one or more of the 
molecules produced in our chemistry can spontaneously aggregate to form 
vesicles thereby providing an enclosure to the ACS dynamics. Enclosures 
such as vesicles can selectively allow exchange of molecules with the en- 
vironment, thereby allowing the system to access new steady states. An 
important body of work in this respect is the one that deals with the growth 



and evolution of protocells 110 114 116 125 170 192 193 . In our model we 



have also excluded self replicating molecules. One can also think of scenarios 
in which one of the molecules produced in the chemistry has the property 
of template replication. While systems of replicators including those where 



the replicator is catalyzed have been extensively studied 168 , 169 , 194 196 



The dynamics of systems with replication reactions as well as catalyzed 
non-replication reactions may be interesting in its own right. Further these 
systems may exist in the presence of enclosures. It will be interesting to 
integrate these different approaches. 
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Appendix A 

A^-independence of results at large 

N 



While the chemistry considered in this model is an infinite one, numerical ex- 
ploration requires us to work with a finite set of molecules that participate in 
the chemistry. This introduces an additional parameter N, the size of the largest 
molecule produced in the chemistry, in the model. Here we present evidence show- 
ing that certain important properties of the model become A^-independent at large 
enough A^. Specifically we show that the steady state profile of concentrations be- 
come independent for A^ for both uncatalyzed chemistries (discussed in Chapter 
[3]) and chemistries with ACS (discussed in Chapter |4]). 

A.l Uncatalyzed chemistry 

The steady state concentrations for an uncatalyzed chemistry can be approximated 
by an exponential function, x„ = ce"'''" = cA", where c and A = e""^ are constants 



(see Eq. (3.4)). We calculate A from the slope of the straight line fit on the plot 
of the log of the steady state concentrations, x„, versus length, n. We found that 
for sufficiently large A^, A becomes independent of A^; see Fig. |A.1[ 

When A < 1, the concentrations of the large molecules are small. It is evident 
that if the concentrations of the large molecules produced in the chemistry are so 
small that their contributions to the dynamics of the (relatively) smaller molecules 
is negligible, then, including molecules larger than those already considered in such 
a chemistry would cause no difference to the results. 
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Figure A.l: Dependence of A on N for uncatalyzed chemistries. A — kr ~ 1. We 

determine A at different values of N by fitting the steady state profiles to an exponential, 
Xn — ce~^" (excluding the concentrations xi to X4). We see a dependence of A on N for values 
of A^ < 50, but A becomes essentially independent of N for N > 100. 

A. 2 Catalyzed chemistry 

For chemistries with catalyzed reactions, including ACSs, we find that for suffi- 



ciently large N, the numerical results are independent of A^. In Fig. A.2| we show 
how the steady state concentrations in the chemistry that includes ACS65 (defined 



by Eq. (4.1)) depend upon A^. 



The results for the chemistry with / = 2 were also found to be A^-independent 
(for sufficiently large values of A^). We find that for / = 2 the A^-independence is 
reached at much smaller values of A^ than for / = 1. 
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A. 2. Catalyzed chemistry 




500 



Figure A. 2: Dependence of steady state concentrations on A^ for a chemistry that 
includes an ACS. The figure shows the steady state concentrations for the chemistry that 
includes ACS65 (Eq. (JO)) for N = 100, 200, 300, 500. In all cases A = kf = h 



1,' 



5. The 



steady state concentrations show an A^-dependence upto N = 100, but for N > 200 the profile 
becomes A^-independent. 
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Appendix A. A^-independence of results at large N 
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Appendix B 

Model run at fixed input rate of 
monomer instead of buffered 
concentration 

In Section [275] we have discussed the differences between our model and that stud- 



ied by Bagley and Farmer 158, 159, 172 . One of the differences is that of the 



boundary condition. In our model, we have considered the food set molecules to 
be buffered, i.e., their concentrations remain constant in time. Whereas, Bagley 
and Farmer consider a constant influx of the monomers; they consider the sys- 
tem as a chemical reactor in a which a fixed supply of monomers enter per unit 
time and a dilution flux takes away the products of the system. In this Appendix 
we discuss our model with a fixed input rate of monomers instead of buffered 
concentrations. 

Consider the case of a fully connected and homogenous spontaneous chemistry 
with / = 1. The rate equations for n = 2,3, . . . ,N (for n > N/2 the third term is 



absent) are given by (see Eq. (2.9)), 

(N-n) 

Xfi y ^ yrvjXiXj K'j.Xrfij ^ ^ yKjXiXrfi, nJ^^X^-j-nJ 

i<j,i+j=n i=l,i^n 

- 2 {kfxl - krX2n) " 4>Xn. (B.l) 

And for the monomer, 

(JV-l) 

Xi = 5 — y^ {kfXiXi — krXi^i) — 2 (kfxf — krX2) — 4>Xi, (B.2) 

where, 5, is the fixed rate at which the monomer enters the system. 
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If rrii is the molecular weight in atomic mass units (amu) of the monomer 
molecules and x„ the concentration of A{n) in moles per unit volume, the total 



mass of the system is given by M = Yln=i iTT'i^Xn- All the terms in Eqs. (B.l ) and 



(B.2) conserve mass except S, that corresponds to addition of new monomers to 
the system, and terms with 0, which models the loss of monomers (as monomers 
as well as larger molecules) from the system. Therefore, the rate of change of total 
mass is given by, 

M = 6mi- (j)M. (B.3) 

At steady state, the total mass of the system goes to a fixed point, M = -^. 
Thus fixing 6 and (p fixes the steady state value of M. 

In this system we are able to reproduce the result of ACS domination, which 
has been discussed for the buffered food set case in Chapter 111 For ACS65 (defined 



by Eqs. (4.1h) we are able to see that the molecules produced in ACS are able to 



dominate well above the background as was the case in the buffered input system. 



We even find bistability in the dynamics of the system as k changes (see Fig. B.l ). 
In the lower attractor, most of the mass is accumulated in the small molecules, 
whereas in the higher attractor the ACS molecules are able to dominate over the 



background (see Fig. B.2) 
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Figure B.l: Bistability in the dynamics of ACS65 for a fixed input rate of monomers. 

The steady state concentration of A(65) versus k for kf = kr = 1,6 = 5000, (/> = 100. The 
steady state mass in the system is M = 50. This figure was generated in same way as Fig. 
4. 6| starting from two different initial conditions, but with a fixed input rate instead of buffered 



concentration of the monomer (i.e., using Eq. (B.2| for the monomer concentration instead of 
xi = 1. In region I, both initial conditions lead to the same attractor in which the mass is largely 
accumulated in small molecules. Also in region III, the system has a single attractor in which 
the ACS molecules dominate over the background. The system exhibits bistability, i.e., it has 
two possible attractors, in region II. See Fig. |B.2| for the steady state concentration profiles of 
the system in the two attractors. 
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Appendix B. Model run at fixed input rate of monomer instead of buflfered 
concentration 
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Figure B.2: Steady state concentration profile for the system with fixed input rate 
of monomers. The steady state concentration Xn versus the length of molecule n is plotted 
for the system with fixed input rate of monomers, starting from two initial conditions. The red 
curve plots the steady state concentration profile for the attractor achieved from the standard 
initial conditions {xn = OV n — 1,2, . . . , N); blue curve plots the same for the attractor achieved 
from the high initial condition {xn = 1 V n). 
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Appendix C 

Comparison with the model of 
Ohtsuki and Nowak 



Ohtsuki and Nowak discussed a model of prelife catalysts 169 . They consider two 
type of activated monomers: 0* and 1*, produced spontaneously and abundantly 
in prebiotic chemistry. These monomers can deactivate to produce inactivated 
monomers, and 1. Activated monomers can combine with inactivated molecules 
to produce larger molecules, i + 0* — )■ iO and i + 1* — )■ il. It is assumed that 
the molecules only grow from one side by the addition of activated monomers. 
Therefore, each string i has a unique precursor, i', and two followers, denoted iO 
and zl {i.e., ii i = 010, iO = 0100). The dynamics of the system are given by, 

iji = aiHi' - {d + aio + aii)yi, i = 0,l, 00, 01, ... , (C.l) 

where i represent all binary strings, Xj is the concentration of sequence i and 
Oj is the rate constants for the reaction in which i is produced form i'. Ojo is 
the rate constant for the reaction i + 0* — )■ iO and an the rate constant for the 
reaction i + 1* — )■ il. It is assumed that the concentration of activated monomers 



is buffered, and the rate constants subsume their concentrations in Eq. (C.l). For 
production of deactivated monomers, and 1, via reactions 0* — )■ and 1* — )■ 1, the 
predecessors i' are 0* and 1* respectively, d is a loss rate at which the molecules are 
removed from the system. Each sequence of strings, obtained by repeated addition 
of Os or Is, is termed as a lineage, e.g., 0, 00, 000, . . ., or 1, 10, 101, 1010, . . ., etc. 
For the case of 'fully symmetric' prelife, assume that a^ = ai = A/2 and ai = a 
for all other sequences, i. In this case, all sequences of length n have the same 
steady state concentration and one may focus on a particular lineage, which for 
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Appendix C. Comparison with the model of Ohtsuki and Nowak 



convenience may be chosen to be 0, 00, 000, .... Between them the reactions are 

0* ^ (C.2a) 

(C.2b) 
(C.2c) 
(C.2d) 







+ 0* 


^00 






00 + 0* 


^000 




1 


000 + 0* 


^0000 


and the rate equations 


are given 


by, 






2/0 


A 

~2^°* " 


-{d + 2a)yo, 




yoo 


=a — {d 


+ 2a)|/oo, 




2/000 


=a-{d 


+ 2a)|/ooo, 



(C.3a) 
(C.3b) 
(C.3c) 



yo* is the concentration of 0*. Let a sequence of length k be denoted as 0^, k 
1,2,..., then one may write the equations as 

. A 



2/1 



:2/o- 



{d + 2a)yi, 



ijk =ayk-i + {d + 2a)yk, /c = 2, 3, . . . 



(C.4a) 
(C.4b) 



In their catalyzed prelife model, Ohtsuki and Nowak assume that some strings 
can catalyze certain reactions. For example, if sequence j is a catalyst for the 
reaction i + 0* — > iO, it enhances the rate of the reaction from Ojo {dio + cXj), where 
ttio is the spontaneous reaction rate and c is the strength at which j catalyzes the 
reaction. They call a 'perfect prelife catalyst' as one which catalyzes all the ligation 
reaction in its own lineage. In fully symmetric prelife with a perfect catalyst of 
length 5 we can write following reactions 

(C.5a) 

(C.5b) 

(C.5c) 

(C.5d) 

(C.5e) 
(C.5f) 
(C.5g) 

(C.5h) 



0* 




-> 





+ 0* 


00000 


-> 


00 


00 + 0* 


00000 


-> 


000 


000 + 0* 


00000 


-> 


0000 


0000 + 0* 


00000 


-^ 


00000 


00000 + 0* 




-> 


000000 


000000 + 0* 




-^ 


0000000 
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C.l. Reducing our / = 1 model to Ohtsuki and Nowak model 



The rate equations for the system described by Eqs. (C.5) are given by, 

yi=^yo* - id + a+{a + cy5))yi, (C.6a) 

?/2 =(a + cyr,)yi + {d + a + {a + cy^o))y2, (C.6b) 

?/3 =(a + cy^)y2 + {d + a + {a + cy^))yz, (C.6c) 

?/4 =(a + c?/5)|/3 + (d + a + (a + c|/5))2/4, (C.6d) 

?/5 =(a + c?/5)y4 + {d + 2a)|/5, (C.6e) 

2/6 =a2/5 + {d + 2a)y6, (C.6f ) 

2/7 =^2/6 + id + 2a)?/7, (C.6g) 

; (C.6h) 



C.l Reducing our / = 1 model to Ohtsuki and Nowak 
model 



Consider the following forward reactions in our model, 

A(l) + A(l) ^ A(2) 

A(l) + A(2) ^ A(3) 

A(l) + A(3) ^ A(4) 

A(l) + A(4) -^ A(5) 



(C.7a) 
(C.7b) 
(C.7c) 
(C.7d) 



All other reactions are omitted from the chemistry by setting their respective 
reaction rate constants equal to zero, kf, = for all i ^ l,j, kfj = for all i,j 



pairs. Thus from Eq. (2.6) we get 



Xr 



-fci(„_l)3;iXn_i fc-|^^XiXn (pr 



•^725 '^ Z, O, . . . 



(C.^ 



We now make following relabeling of species in Ohtsuki and Nowak model to 
compare it with our model, O'^ is now labeled as A{k + 1) and 0*, the buffered 
activated monomer in the system, is now labeled A(l). Explicitly, 

0*^ A(l), 

Oo A(2), 

00 o A(3), 

000 ^ A(4), 



115 



Comparing Eqs. (C.8) and (C.4) we get, k^^Xi = A/2, fc^ = a for all n > 2, 



and 



= {d + a) for all n. Note, reaction (C.7a) is a bimolecular reaction, where 
as reaction (C.2a) is a unimolecular reaction, but as xi{= A) is a constant it 
can be absorbed in the rate constant. For a catalysed reaction, nk^ = c. (Note 
that Ohtsuki and Nowak only consider spontaneous transformation of activated 



monomers into deactivated ones, thus reaction (C.7a) is not catalysed. 



With these transformations our model reduces exactly to the fully symmetric 
prelife model with catalysis given by Ohtsuki and Nowak. All their analytical 
results also apply to this (reduced) model. Numerically we have been able to 
reproduce all their results. An example of steady state concentration profile for a 
chemistry containing the following set of catalyzed reactions is shown in Fig. |C.l 



A(l) + A(2) 
A(l) + A(3) 
A(l) + A(4) 
A(l) + A(5) 



A(6) 



A(6) 



^ A(3) 



-^('^ A(4) 
-^(^^ A(5) 



■> A(6) 



(C.lOa) 
(C.lOb) 
(C.lOc) 
(C.lOd) 



In this reduced form of the model only the largest molecule produced in catalyzed 
reactions dominates the population. 
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C.l. Reducing our / = 1 model to Ohtsuki and Nowak model 
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Figure C.l: Steady state concentration profile for reduced model. The plot shows the 
steady state concentrations a;„ versus length of molecule n for the catalyzed chemistry defined 
by Eqs. (C.fO). A(l) or the activated monomer 0* has a buffered concentration fixed at 1. 



Xn,n > 2 gives the population of molecule A{n) that models the member of the hneage with 
length n — 1. A(6), which is the largest molecule produced in the catalysed set dominated the 
population. Parameters take following values ^ = 1, fcfi(= A/2) — 1/2, k[j = l,(p = 2, k = 500. 
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Appendix C. Comparison with the model of Ohtsuki and Nowak 
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Appendix D 

The origin of multistability in ACS65 



In this appendix we attempt to give an intuitive understanding of why ACS65 has 
a muhistabihty in its dynamics, in particular the existence of the third attractor 



with intermediate concentrations shown in Fig. 4.7, The reason rests on the 



substructure of ACS65. Notice that any catalyzed chemistries defined by a subset 



of the reactions given in Eqs. 4.1 (that define ACS65) is not an ACS. Thus our 



discussion naturally leads to non-ACS catalyzed chemistries. 

D.l Catalyzed chemistries not containing an ACS 

A particularly interesting case of catalyzed chemistries that do not contain an 
ACS arises when the set of catalyzed reactions form unbroken pathways from the 
food set to higher molecules, but the catalysts are not drawn from the product 
set of these reactions {i.e., the catalyzed chemistry satisfies condition (b) for an 



ACS but not the condition (a); see Section 4.1). In this case we again observe the 
domination of the molecules produced by the catalyzed pathways, but this occurs 
at even higher catalytic strengths than for the ACSs that include production 
for these catalysts. The reasons are similar to those discussed above in Section 



4.2.1 in the context of the k — )■ oo limit for ACSs (note that in that limit the 
identity of the catalyst is irrelevant for the argument as long as the catalyst has 
a nonzero concentration). We have also observed bistability in these chemistries 
(an example is discussed below) as well as a dependence on similar to the ACS 
case. In general, catalyzed chemistries (with and without ACSs) seem to have a 
rich and complex phase structure but more comprehensive investigations than we 
have done are needed to nail down the possible range of interesting behaviours. 

Consider the following set of catalyzed reactions along with full spontaneous 
chemistry. Notice that there is a continuous path from the food set to A(4). And 
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Appendix D. The origin of multistability in ACS65 

the catalysts, namely, A (9) and A (5), are not produced in the catalyzed reactions 
themselves. This chemistry is in fact a subset of ACS65, containing the first two 
catalyzed reactions of the set. 

(D.la) 



A(l) + A(l) ^— ^ 


A(2) 


A(2) + A(2) ^^^^ 


A(4) 



(D.lb) 
The steady state concentration profile for this chemistry which we refer to as 



C9 (as A(9) is the largest catalyst in this chemistry) is shown in Fig. D.la, As for 
an ACS, the molecules produced in the catalyzed reactions, viz., A (2) and A (4) 
have much larger concentrations as compared to other species in the chemistry. 



Fig. D.lb shows bistability in the steady state concentrations as k is varied. 
Note that the initiation threshold happens to be the same as for ACS65 namely 
K^^ = 2226392, but the maintenance threshold is now considerably higher. 

D.2 Reasons for multistability in dynamics of ACS65 

The above example shows that bistability can arise even in a simple catalyzed 
chemistry that has continuous catalyzed pathways originating from the food source. 
This is in fact the origin of multistability in dynamics of ACS65. Consider the 



case of ACS65 defined in Eqs. 4.1, but with reaction (4.1c) deleted. Note that 
the resulting set of 7 catalyzed reaction pairs is no more an ACS, but merely a 
catalyzed chemistry which we refer to as C65. The continuous catalyzed pathways 
end at A(4) just as in C9 above. The catalyzed chemistry C65 with 7 catalyzed 



reaction pairs also shows a bistability (Fig. D.2a). The steady state concentration 



profile for the system in the upper attractor is shown in Fig. |D.2b 



The main point we wish to draw attention to is that the three catalyzed 
chemistries C9, C65 and ACS65, all have been found empirically to have bistability 
with the same value of k^^ = 2226342. Since C9 is the smallest catalyzed chemistry 
that is common to all three (note that as sets of reactions, C9 C C65 C ACS65), 
it is reasonable to suppose that it is the structure of C9 that is determining the 
value of K,^^ for all three. That is, the additional catalyzed reactions in C65 and 
ACS65 do not seem to matter as far as the value of k,^^ is concerned. 



Notice that the catalyst for reaction (4.1c) in ACS65 is A(28). As A(28) a is 



considerably large molecule, its concentration in the lower attractor is very small; 
the product k, x a;28 is thus much smaller than 1. That is, the rate of the catalyzed 



reaction (4.1c) is much less than the corresponding spontaneous reaction. For 
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D.2. Reasons for multistability in dynamics of ACS65 
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Figure D.l: Steady state concentrations and bistability of a catalyzed chemistry. 



(a) The steady state concentration profile for the catalyzed chemistry (D.l I. A = kf = kr = 
l,<j> — 15,K = 5x 10^. A(2) and A(4) have higher concentrations compared to other molecules in 
the background, (b) The hysteresis curve showing bistability in the dynamics of this catalyzed 
chemistry without an ACS. The parameters take same values as in (a), except k, which is varied. 
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this reason, it can be assumed that the catalyzed reaction (4.1c) is absent in the 



range where k x X28 remains much less than 1 (see Section 4.2.1 and also 5.1.3). 
Therefore ACS65 in this range also exhibits the same behaviour as C65. 



Now C65 has an upper attractor given by the upper red line in Fig. D.2a 



Therefore ACS65 also acquires this steady state. This adds an additional stable 
state, inducing multistability, in the dynamics of ACS65. The multistability curve 



for ACS65 is also shown in Fig. D.2a where the intermediate steady state of 
ACS65, overlaps with higher steady state of C65. The steady state profiles of C65 
in its higher attractor and ACS65 in its intermediate attractor, shown in Figs. 



D.2b and 4.7b are essentially indistinguishable. 
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D.2. Reasons for multistability in dynamics of ACS65 
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Figure D.2: Bistability in the dynamics of the catalyzed chem C65, which is the 



same as ACS65, but with the catalyzed reaction (4.1c) deleted, (a) The hysteresis 



curve for the catalyzed chemistry C65. For comparison the phase portrait for ACS65 is also 
shown in dotted line. The intermediate steady state in ACS65 corresponds to the upper steady 
state of C65. The parameters are kf = k,- = A = l,(p = 15,N = 100 (b) The steady state 
concentration profile in upper steady state, k = 5 x 10^ other parameters take same values as 



in (a). Note that this profile is essentially the same as that for ACS65 shown in Fig. 4.7b 
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Appendix E 

Effect of ACS topology on bistability 



In Section 4^ we have discussed how bistabihty arises in simple ACSs. We extend 
the discussion to include how the assignment of catalysts to the reactions of an 
ACS affects the dynamics, specifically the existence and extent of bistability. The 
assignment of the catalysts to the reactions of the ACS is an aspect of the topology 
of the ACS. ACSs with the same set of reactions (and hence the same product set 
P{S)) may differ from each other in the assignment of catalysts to the reactions. 
For example, consider following set of reactions comprising an ACS, 

^ ' "^^ (E.la) 



A(l) + A(l) 




A(2) 


A(2) + A(2) 


My) \ 


A(4) 


\ 


A(2) + A(2) 




A(8) 



(E.lb) 
(E.lc) 

where, x,y,z G {2,4,8}. This ACS has 27 possible topologies of catalyzed reac- 
tions, enumerated by the values of x, y, z. The list of topologies and the k^ and 
K^^ values for each topology assuming a homogeneous catalyzed chemistry {i.e., 



all catalysts have the same catalytic strengths) are listed in Table E.l For some 
cases, such as whenever x = 2, i.e., A(2) is the catalyst of the reaction (E.la), 
we do not find bistability in the dynamics of the ACS (irrespective of the values 
y and z take) for the value of (p studied; recall from the discussion in the Section 



4.3 that there exists a critical length and a below which we do not see bista- 
bility. The bistability may exist for some topologies for a higher value of 0, e.g., 
x = 2,y = 8,z = 8has a bistability at = 25 with k^ = 481.5 and k^^ = 2807.4. 
In general, the parameter region where bistability exists in the dynamics depends 
upon X, y and z. We illustrate this in the table below for a homogeneous chemistry. 
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Table E.l: List of all the possible topologies of ACS given by Eq. (E.l) 



and respective k^ and k^^ values. A{x),A{y),A{z) are the catalysts for the 



reactions (E.lal, (E.lbl, (E.lc) respectively. The parameters used to calculate k^ 



and K^^ are A — k 



f 



X 


y 


z 


«;^ 


k'' 


2 


- 


- 


Bistability does not exists for = 5 


4 


2 


2 


4 


2 


4 


4 


2 


8 


4 


4 


2 


6.558503e+01 


6.677801e+01 


4 


4 


4 


6.332721e+01 


6.554611e+01 


4 


4 


8 


6.101660e+01 


6.520190e+01 


4 


8 


2 


1.449191e+02 


1.504237e+02 


4 


8 


4 


1.498893e+02 


1.594422e+02 


4 


8 


8 


1.713392e+02 


2.072384e+02 


8 


2 


2 


2.795060e+02 


8.610626e+02 


8 


2 


4 


2.862641e+02 


1.026008e+03 


8 


2 


8 


3.140126e+02 


3.388361e+03 


8 


4 


2 


2.927453e+02 


1.053617e+03 


8 


4 


4 


2.991615e+02 


1.223987e+03 


8 


4 


8 


3.251521e+02 


3.564402e+03 


8 


8 


2 


3.487341e+02 


5.303906e+03 


8 


8 


4 


3.553727e+02 


5.791569e+03 


8 


8 


8 


3.786843e+02 


1.194718e+04 



126 



Appendix F 

List of reactions and their catalysts 
in ACS441 

The table lists all the reactions with their respective catalysts in the example of a 



catalyzed chemistry, discussed in Section 7.2, containing a cascade of nested ACSs 



for / = 1 generated using Algorithm 4. The steady state concentrations for this 



chemistry are displayed in Fig. 7.1 This chemistry was generated with g = 15 
and rii = 1, n2 = ns = . . . = nis = 2. 

Note that a general molecule for the / = 1 case, represented by A{n) in the 
main text, has been represented here by n for brevity. 

The molecules in various generations are as follows: Pq = {1}, Pi = {2}, P2 = 
{3,4}, P3 = {6,7}, P4 = {10,12}, P5 = {17,18}, Pe = {20,24}, P7 = {25,48}, 
Pg = {52,66}, P9 = {67,69}, Pio = {77,84}, Pn = {144,168}, Pu = {221,288}, 
P-^3 = {305,336}, Pi4 = {372,389}, P15 = {397,441}. 

The catalyst for a reaction listed under generation Pk is added at step k of 
algorithm. It is apparent from the reaction table that the ACSs are maximally 
overlapping, i.e., any ACS of generation k contains all the reactions of generation 
k-1. 
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Appendix F. List of reactions and their catalysts in ACS441 
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Appendix F. List of reactions and their catalysts in ACS441 



130 



Appendix G 

List of reactions and their catalysts 
in ACS(36,28) 

The table lists all the reactions with their respective catalysts in the example of a 



catalyzed chemistry, discussed in Section 7.3, containing a cascade of nested ACSs 



for / = 2 generated using Algorithm 4. The steady state concentrations for this 



chemistry are displayed in Fig. 7.3, This chemistry was generated with g = 7 and 
rik = 3. 

The molecules in various generations are as follows: Pq = {(1, 0), (0, 1)}, Pi = 
{(1,1), (0,2), (2,0)}, P2 = {(1,3), (2, 2), (3,0)}, P, = {(2, 6), (2, 3), (5, 2)}, P4 = 
{(4, 6), (7, 4), (7, 8)}, P, = {(8, 12), (6, 12), (14, 8)}, P, = {(14, 24), (22, 20), (11, 12)}, 
P7 = {(29, 28), (36, 28), (24, 26)}. 

The catalyst for a reaction listed under generation Pk is added at step k of 
algorithm. It is apparent from the reaction table that the ACSs are maximally 
overlapping, i.e., any ACS of generation k contains all the reactions of generation 
k-1. 
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Appendix H 

List of reactions and their catalysts 
in ACS(18,27) 

The table lists all the reactions with their respective catalysts and catalytic strengths 



in the example of a catalyzed chemistry, discussed in Section 7^, containing a cas- 
cade of partially overlapping ACSs for / = 2 generated using Algorithm 5. The 



steady state concentrations for this chemistry are displayed in Fig. 7.4 



The catalyzed chemistry contains 10 generations of ACSs of lengths 3, 6, 10, 
15, 19, 25, 30, 35, 40, and 45. 



Table H.l: List of reactions in the ACSs of different length and their cat- 
alysts. ACSs of increasing length (using Algorithm 5) are added in the chemistry. 
The length of the ACS and the catalytic strength of the catalyst are mentioned in 
the table. 



Reaction 


Catalyst 


Generation 1 of length 3 (k = 


= 1000) 


(1,0) + (1,0) ^(2,0) 


(3,0) 


(2,0) + (1,0) ^(3,0) 


(3,0) 


Generation 2 of length 6 (k = 


= 2000) 


(1,0) + (1,0) -(2,0) 


(4,2) 


(0,1) + (2,0) -(2,1) 


(4,2) 


(2,1) + (0,1) ^(2, 2) 


(4,2) 


(2,1) + (2, 1)^(4, 2) 


(4,2) 



continued on next page . . . 
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Table H.l 


. . . continued from, previous page 


Reaction 


Catalyst 


Generation 3 of length 10 (k = 4000) 


(1,0) + (0,1) ^(1,1) 


(4,6) 


(1,1) + (1,0) ^(2,1) 


(4,6) 


(1,1) + (1,1) ^(2, 2) 


(4,6) 


(0,1) + (2, 2) ^(2, 3) 


(4,6) 


(2, 3) + (2, 3) ^(4, 6) 


(4,6) 


Generation 4 of length 15 (k = 7000) 


(0,1) + (0,1) ^(0,2) 


(7,8) 


(0,2) + (1,0) ^(1,2) 


(7,8) 


(1,2) + (1,0) ^(2,2) 


(7,8) 


(2, 2) + (2, 2) ^(4, 4) 


(7,8) 


(1,2) + (4,4) ^(5,6) 


(7,8) 


(5,6) + (1,2) ^(6,8) 


(7,8) 


(5, 6) + (2, 2) ^(7, 8) 


(7,8) 


Generation 5 of length 19 (k = 10000) 


(0,1) + (1,0) ^(1,1) 


(7,12) 


(1,1) + (0,1) ^(1,2) 


(7,12) 


(1,2) + (1,2) ^(2,4) 


(7,12) 


(1,1) + (2,4) ^(3,5) 


(7,12) 


(2, 4) + (3, 5) ^(5, 9) 


(7,12) 


(1,1) + (5, 9) ^(6, 10) 


(7,12) 


(1,2) + (6, 10) ^(7, 12) 


(7,12) 


Generation 6 of length 25 (k = 15000) 


(0,1) + (1,0) ^(1,1) 


(12,13) 


(1,1) + (1,0) ^(2,1) 


(12,13) 


(2,1) + (0,1) ^(2, 2) 


(12,13) 


(2, 2) + (0,1) ^(2, 3) 


(12,13) 



continued on next page . 
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Table H.l 


. . . continued from previous page 


Reaction 


Catalyst 


(1,0) + (2, 3) ^(3, 3) 


(12,13) 


(2,1) + (3,3) ^(5,4) 


(12,13) 


(2, 3) + (5, 4) ^(7, 7) 


(12,13) 


(7, 7) + (2, 2) ^(9, 9) 


(12,13) 


(7, 7) + (3, 3) ^(10, 10) 


(12,13) 


(9,9) + (2,3) ^(11, 12) 


(12,13) 


(11, 12) + (1,1) ^(12, 13) 


(12,13) 


Generation 7 of length 30 (k = 20000) 


(0,1) + (1,0) ^(1,1) 


(13,17) 


(1,1) + (0,1) ^(1,2) 


(13,17) 


(1,2) + (1,1) ^(2, 3) 


(13,17) 


(1,0) + (2, 3) ^(3, 3) 


(13, 17) 


(3,3) + (1,1) ^(4,4) 


(13,17) 


(3,3) + (1,2) ^(4,5) 


(13,17) 


(2, 3) + (4, 5) ^(6, 8) 


(13,17) 


(2, 3) + (6, 8) ^(8, 11) 


(13, 17) 


(6, 8) + (4, 4) ^(10, 12) 


(13,17) 


(10, 12) + (2, 3) ^(12, 15) 


(13,17) 


(12, 15) + (1,2) ^(13, 17) 


(13,17) 


Generation 8 of length 35 {k = 27000) 


(1,0) + (0,1) ^(1,1) 


(15,20) 


(0,1) + (1,1) ^(1,2) 


(15,20) 


(1,1) + (1,2) ^(2, 3) 


(15,20) 


(1,2) + (2, 3) ^(3, 5) 


(15,20) 


(3, 5) + (1,1) ^(4, 6) 


(15,20) 


(1,1) + (4, 6) ^(5, 7) 


(15,20) 


(3, 5) + (4, 6) ^(7, 11) 


(15,20) 



continued on next page . . 
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Table H.l 


. . . continued from, previous page 


Reaction 


Catalyst 


(7, 11) + (4, 6) ^(11, 17) 


(15,20) 


(1,0) + (11, 17) ^(12, 17) 


(15,20) 


(1,1) + (12, 17) ^(13, 18) 


(15,20) 


(13,18) + (1,2) ^(14,20) 


(15,20) 


(1,0) + (14,20) ^(15,20) 


(15,20) 


Generation 9 of length 40 (k = 35000) 


(0,1) + (0,1) ^(0,2) 


(14,26) 


(1,0) + (0,2) ^(1,2) 


(14,26) 


(1,2) + (1,2) ^(2,4) 


(14,26) 


(1,2) + (2,4) ^(3,6) 


(14,26) 


(3,6) + (1,0) ^(4,6) 


(14,26) 


(3, 6) + (2, 4) ^(5, 10) 


(14,26) 


(5, 10) + (1,2) ^(6, 12) 


(14,26) 


(6, 12) + (2, 4) ^(8, 16) 


(14,26) 


(2,4) + (8,16) ^(10,20) 


(14,26) 


(0,1) + (10,20) ^(10,21) 


(14,26) 


(10,21) + (0,2) ^(10,23) 


(14,26) 


(0,2) + (10, 23) ^ (10,25) 


(14,26) 


(10,23) + (1,2) ^(11,25) 


(14,26) 


(8, 16) + (5, 10) ^ (13,26) 


(14,26) 


(4,6) + (10,20) ^(14,26) 


(14,26) 


Generation 10 of length 45 (k = 50000) 


(0,1) + (1,0) ^(1,1) 


(18,27) 


(0,1) + (1,1) ^(1,2) 


(18,27) 


(1,2) + (1,0) ^(2,2) 


(18,27) 


(1,2) + (1,1) ^(2, 3) 


(18,27) 


(1,2) + (2,2) ^(3,4) 


(18,27) 



continued on next page . . . 
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Table H.l 


. . . continued from previous page 


Reaction 


Catalyst 


(2, 3) + (2, 3) ^(4, 6) 


(18,27) 


(4, 6) + (1,1) ^(5, 7) 


(18,27) 


(4, 6) + (3, 4) ^(7, 10) 


(18,27) 


(7, 10) + (7, 10) ^ (14,20) 


(18,27) 


(0,1) + (14,20) ^(14,21) 


(18,27) 


(14,21) + (2,2) ^(16,23) 


(18,27) 


(2, 3) + (16, 23) ^ (18,26) 


(18,27) 


(14,21) + (4,6) ^(18,27) 


(18,27) 
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Appendix I 

Description of the programs used to 
generate the results in the thesis. 



This appendix discusses the programs used to generate various numerical results 
in the thesis. The code source files can be downloaded from the website: 
http : //sites . google . com/sit e/varungiri/research/phd-thesisi 



The numerical work reported in this thesis was largely done using C codes. For 
the numerical integration of differential equations these codes use CVODE solver 



library of the SUNDIALS package v. 2. 3.0 174 which can be downloaded from 



https : //computation . llnl . gov/casc/sundials/download/download . html (Novem 



ber 2006 release version). The list of programs is as follows: 

Chapter 3 

• spontaneous-f 1 . c, spontaneous-f 2.c : Constructs a fully connected spon- 
taneous chemistry for / = 1, 2 (respectively) and integrates the system of 
differential equations to find steady state concentrations. This program pro- 
duces the time courses and the steady state concentration values for a class 
of user specified initial conditions. 

• spontaneous-sparse-f 1 .c, spontaneous-sparse-f 2.c : Constructs a sparse 
spontaneous chemistry with degree k specified as a parameter to the pro- 
gram for / = 1, 2 (respectively) and produces the steady state concentration 
values. 

• spontaneous-hetro-f 1 . c : Constructs a fully connected spontaneous chem- 
istry. It then assigns the rate constants for each reaction as described in 
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Section 3.4 (for heterogeneous chemistries) and computes the steady state 



concentration values. 



Chapter 4 



acs-f 1 . c : Constructs the fully connected spontaneous chemistry for / 



with catalyzed reactions defined by ACS65 (Eqs. (4.1)) and computes the 
steady state concentrations. 

acs-f 2 . c : Constructs the fully connected spontaneous chemistry for / = 2 



with catalyzed reactions defined by ACS(8,10) (Eqs. (4.2)) and computes 
the steady state concentrations. 



ACS-2-4.ode : Input file for XPPAUT for ACS4 (Eqs. (4.4)). 



acs-sparse-f 1 .c : Constructs a sparse spontaneous chemistry for / = 1 
with catalyzed reactions defined by ACS65 (Eqs. ( |4.1 )) and computes the 
steady state concentrations. 

acs-hetro-f 1 . c : Constructs a fully connected spontaneous chemistry and 
assign rate constants for each reaction as described in Section [3^ It cat- 
alyzes the reactions as defined by ACS65 (Eqs. ( |4.1 )) and computes steady 
state concentration values. 



These programs can be appropriately changed to study other ACSs. The instruc- 
tions to do so are included with the code. 



Chapter 5 



algol.c, algo2.c, algoS.c : Constructs a fully connected spontaneous 
chemistry for / = 1 with an ACS of length L constructed using Algorithm 
1, 2, or 3 (respectively) described in Section 5.1.1 



Chapter 6 



acs-f l-ACS3_8.c : Constructs a fully spontaneous chemistry for / 



with a pair of nested ACSs defined by Eqs. (6.1) and (6.2) and computes 
the steady state concentration values. 

acs-f 2-ACS4_8.c : Constructs a fully spontaneous chemistry for / = 2 



with a pair of nested ACSs defined by Eqs. (6.4) and (6.7) and computes 
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the steady state concentration values. The ACSs defined in this program 
can be modified to study the dynamics of the other ACSs mentioned in the 



text below Eq. (6.4) 



Chapter 7 



algo4-lf .c, algo4-2f . c : Constructs a sparse spontaneous chemistry for 
/ = 1,2 (respectively) with nested ACSs constructed using Algorithm 4 



described in Section 7.1, and computes steady state concentration values. 



algo5-2f . c : Constructs a sparse spontaneous chemistry for / = 2 with 



nested ACSs constructed using Algorithm 5 described in Section |7.1[ and 
computes steady state concentration values. 
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The definitions given here are for quick reference. For precise definitions see text. 

ACS Autocatalytic set. 49 
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lyst of this chemistry gets produced by a reaction of the chemistry, and (b) 
every reactant in the chemistry is either a member of the food set 5" or gets 
produced in this chemistry. 49 

background The set of all molecules except the ACS product molecules and the 
food set. 50 

catalytic strength The factor by which a catalyst enhances the rate of the re- 
action it catalyzes, denoted k. 23 

catalyzed chemistry The set of catalyzed reactions together with the catalysts 
and their catalytic strengths. 24 

connected chemistry A chemistry in which every molecule can be produced 
from the food set in some pathway consisting of a sequence of allowed reac- 
tions. 23 

fully connected chemistry A chemistry in which all possible ligation and cleav- 
age reactions are allowed: /c^y ¥" ^"^^ ^xy t^ ^^^^ ^^^ X,Y. 23, 33 

homogeneous chemistry A chemistry in which all the nonzero rate constants, 
loss rates, and concentration of the food set molecules are independent of 
the species labels. 23, 25, 29, 33 

homogeneous catalyzed chemistry A catalyzed chemistry in which the cat- 
alytic strength of every catalyst for each reaction that it catalyzes is the 
same. 24 
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reversible chemistry A chemistry for which each allowed reaction is reversible, 
i.e., for every reaction, k^ ^ ^^ k^ ^ 0. 23 

sparse chemistry A chemistry that contains only a small fraction of all possible 
reactions. 23, 33, 41 

spontaneous chemistry The set of possible reactions, defined by non-zero rate 
constant kxy and kxy 22, 23, 29, 33 

extremal ACS An ACS of length L is 'extremal' if all reactions are catalyzed 
by the largest molecule (of size L) produced in the ACS. 69, see length of 
ACS 

finite chemistry A chemistry in which all reactions that produce or consume 
any molecule of length larger than some positive integer A^ are omitted. 24, 

25, 27 

food set A set of small molecular species that are presumed to be abundantly 
present in a prebiotic niche, denoted 5". 21 

length of ACS The largest molecule produced in the ACS, denoted L. 69, see 
ACS 

standard initial condition Initial condition in which all concentrations other 
than the food set are zero. 34, 37 
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